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DNA Expression (n Transfected Cells And 
Assays Carried Out In Transfected Cells 

This invention relates to methods of expressing DNA in ceils, to vectors for 
expression of DNA in cells and to transfected cells. The invention also relates to 
assays carried out in transfected ceils or differentiated derivatives of such ceils, in 
particular the invention relates to transfection of and expression of DNA in embryonic 
stem (ES) ceils. 

The wealth of sequence information now becoming available from the genome 
projects demands the development of new, high throughput systems for functiona! 
analysis. A powerful route to discovering and characterising genes involved in 
determination and differentiation in mammals is potentially available via the genetic 
manipulation of ES ceils in vitro. 

ES cells, which are derived from the pluripotential inner cell mass (1CM) of the 
preimplantation mouse embryo (2,3), retain the capacity for multilineage differentiation 
both In vitro (4,5) and In vivo (6,7). In principle, therefore, gene products which 
influence developmental decisions should be assayable in ES cell culture systems, 
whatever the source of the cells. However, there are major difficulties in analysing 
cDNA function by ES cell transfection. The frequency of isolating stable transfectants 
is low (<1(T* by eiectroporation, caicium phosphate co-precipitation or lipofection) and 
the great majority of transfectants show heterogeneous and unstable expression. 

These problems are particularly significant in the case of cDNAs whose expression 
causes differentiation because differentiated ES ceil progeny do not generally 
proliferate. In such cases transfectants may still be isolated but transgene expression 
will be minimal. 

Episomal vectors have been used for functional screening in other ceil types in order 
to increase the frequency of stable transfection and to achieve reliable transgene 
expression. However, previously described episcmal vectors, for example based on 
Epstetn-Barr virus (EBV) or bovine papilloma virus (8PV), have limitations both in 
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host cell range and maintenance during long-term culture. 
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A modified extrachromosoma! vector is known based on the replication system of 
murine polyoma virus (8). This plasmid, pMGD20neo, can be stably maintained as 
an episome in ES celts during long term culture. Importantly, the low levels of large 
T protein produced have no overt effect on the growth or differentiation properties of 
the ES cells (8,9). it is also known to maintain simultaneously with pMGD20neo a 
second episomal vector. Expression from the second vector was not possible hence 
pMGD20neo was used for cDNA expression. However, this vector already comprises 
two expression cassettes, one each for large T antigen and the neo selectable marker 
so its size constrains its use for expression of a third cassette containing a cDNA. 

It is an object of the invention to provide a vector for transfection of and expression 
of DNA within a ceil and a method of expressing DNA in a cell that overcomes or at 
least ameliorates the disadvantages identified in the art. An object of at feast the 
preferred embodiments of the invention is to achieve, in a transfected cell, expression 
that is more stable and more homogenous than hitherto attainable. Further objects 
of preferred embodiments of the invention are to provide a method of expressing a 
DNA in an embryonic ceil in a more stable and more homogenous manner than 
hitherto attainable, and to provide for stable transfection of embryonic cells at a 
higher frequency than can be obtained using conventional vectors. 

The invention is based upon the maintenance of a vector within a cell, wherein 
maintenance of the vector is dependant upon the continued presence within the cell 
of a certain factor and wherein that factor is not expressed by the vector but is 
produced in or present in the cell in an amount sufficient to maintain the vector. 

Accordingly the invention provides a transfection and expression method comprising, 
in a cell that expresses or will express a replication factor, introducing a vector 
dependant upon that replication factor. Thus, in a first aspect, the invention provides 
a method of expressing a DNA in a cell, comprising: 

(a) (!) transfecting the cell with a first vector that expresses a 
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replication factor; or 
(ii) otherwise obtaining a cell that expresses or will express the 
replication factor; 

and 

(b) transfecttng the cell with a second vector, wherein 

(i) the second vector contains a DNA, or is adapted to receive a 
DNA, in operative combination with a promoter for expression of 
the DNA; and 

(ii) extrachromosomal repiication of the second vector is dependant 
upon presence within the cell of the replication factor. 

The replication factor is optionally non-toxic to the cell. Alternatively, the replication 
factor is toxic to the ceil at high levels of expression but at low ievels of expression 
is substantially non-toxic to the ceil but at these low levels is present in sufficient 
amount to enable replication of the second vector. 

Further, the replication factor preferably does not alter the ability of the cell to 
differentiate or proliferate, and may thus be regarded as being neutral to the cell 
phenotype. This enables the activities of the product of a cDNA to be investigated 
over a long time period and many cell generations without having to take account of 
possible interfering effects of the replication factor present within the cell. Again, the 
replication factor may be phenotype-neutra! at ail levels or may be neutral at a low 
level which is nevertheless a sufficient level to maintain the second vector within the 
cell. 

The invention is of application to ali cell types for which there exists, whether from a 
natural or synthetic source, a repiication factor capable of maintaining in that cell type 
an episomal vector. The vector is preferably stably maintained, meaning it is 
maintained over a number of cell generations, and at least over 3 generations. The 
cell is preferably selected from the group consisting of mammalian ceils, in particular 
primate ceils or murine cells, and avian cells. It is further preferred that the cell is an 
embryonic cell, in particular an ES, EC {embryonic carcinoma) or EG (embryonic 
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gonadal) cell, or differentiated progeny of any such cell 
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While reference is made to the second vector, it will be appreciated that the 
replication factor is optionally present in the celi other than following transfection with 
a first vector. For example a culture of cells that already express the replication factor 
may be obtainable from a third party. 

in an embodiment of the invention described in detail below, the method comprises 
transfecting an ES cell with a first vector that expresses a viral replication factor, and 
thereafter transfecting the ES cell with a second vector that expresses a cDNA and 
is dependant upon presence of the virai replication factor for its extrachromosomal 
replication within the ES cell. The frequency of the first transfection step is generally 
low and may result in as few as 1 in 10 s successful stable transfectants - this level 
of success is recognised as typical in this art. However, the second transfection has 
surprisingly and advantageously found to result in a significantly higher frequency of 
successfui stable transfectant colonies being obtained. The second transfection can 
be carried out with a 1% or higher success rate, which represents a 100-fold 
improvement over the art. 

One suitable viral replication factor for mouse cells, in particular mouse ES cells, is 
polyoma large T antigen, in which case the cell of step (a) expresses the polyoma 
large T antigen and the second vector comprises an origin of replication that binds 
the polyoma large T antigen, such as the polyoma replication origin, referred to as 
Ori. Another suitable viral replication factor for primate cells is based upon Epstein 
Barr virus, in which the primate eel! of step (a) expresses the EBNA-1 antigen and 
the second vector comprises an origin of replication that binds EBNA-1 , such as QriP. 
Viral replication factors are generally species - specific and so expression of DNA 
according to the invention is dependent upon choice of a replication factor appropriate 
to the cell. Polyoma large T has been described for use in mouse cells. E/JNA-1 is 
suitable for human cells. Still further systems are optionally based on papilloma virus 
replication factors, for human cells, or SV40 virus targe T antigen, for simian cells, 
and further suitable replication factors may also be selected from functional variants, 
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derivatives and analogues of these replication factors, such as temperature sensitive 
variants. 

In use, the second vector is constructed according to standard techniques so as to 
contain a cDNA sequence or insert of interest operativeiy combined with a promoter 
to express the cDNA. The second vector is used to transfect an ES ceil already 
expressing a replication factor and successful transfectants are recovered in which 
it is found that the second vector is stably maintained within the ES ceil and 
expresses the cDNA with a more homogenous pattern than when prior art techniques 
are followed. Thus, the invention provides an advantageous method for expression 
of a cDNA in a cell. 

In this context, "homogenous" in relation to expression of a cDNA in a colony of 
transfected ES celis is used to indicate that most ceils, or a large proportion of ceils, 
or preferably most cells, or more preferably substantially all ceils, express the cDNA 
and "stable" is used to indicate that the celis continue to express the cDNA at a 
similar level and preferably at substantially the same level. In the examples carried 
out to date and described below, homogenous transfection is seen with the method 
of the invention to a greater extent than in the art methods. Also, in the examples 
carried out to date and described below the method results in more stable expression, 
meaning that expression does not alter over time. This has the advantage that study 
of the long term effects of a cDNA product is facilitated. 

It is optional for the cell of step (a) first to be obtained or prepared by transfection of 
a cell by a first vector and for this then to be used for the starting cells for carrying 
out a plurality of separate transfections by second vectors containing different DNA 
- inserts coding for different DNA products of interest Following this procedure, the first 
transfection may be carried out with the level of success typically seen in 
conventional techniques and the ES cells obtained divided into separate colonies. 
The second transfections, introducing the ONA insert in the second vector, are then 
carried out with the higher levels of success typically seen in the methods of the 
invention. 
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in the case that the method comprises transfection with first and second vectors, it 
is preferable for the first vector to code for a selectable marker and for the second 
vector also to code for a selectable marker, though a different one. in a specific 
embodiment of the invention described below, the first vector codes for hygromycin 
resistance and the second codes for neomycin resistance. This allows selection of 
ES cells in which transfection by both first and second vectors has been successful. 

It is a further embodiment of the invention for the method to comprise an additional 
transfection step with a third vector, wherein the third vector contains a cDNA, or is 
adapted to receive a cDNA, in operative combination with a promoter for expression 
of the cDNA, and extrachromosomal replication of the third vector is dependant upon 
presence within the ES cell of the replication factor. Transfection with the third vector 
is optionally at the same time as transfection with the second vector or subsequent 
thereto. 

The second and third vectors preferably each comprise a selectable marker enabling 
selection of ES cells in which transfection has been successful. The respective 
selectable markers are preferably different if the method comprises transfection with 
both second and third vectors, and preferably different again from the selectable 
marker of the first vector. 

it is a feature of particular embodiments of the invention that the second vector (and 
third or subsequent vectors if present) are not able to express the replication factor. 
In fact, in construction of the second vector from a vector comprising DNA encoding 
the replication factor it is preferable for that DNA to be largely or substantially 
completely deleted. 

In a specific embodiment of the invention, the first vector is pMDG20neo and 
expresses polyoma large T antigen and the second vector comprises the natural 
target for polyoma large T antigen, namely Ori, expresses a cDNA of interest but 
does not express large T antigen, in use, the targe T antigen is expressed by the 
first vector and binds to Or/ of the second vector when it enters an ES cell, thus 
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enabling replication of the second vector and its maintenance within the ES cell in an 
extrachromosomal state, in successful transfectants, the vector remains 
extrachrornosomal, and this is believed to render the vector relatively immune from 
effects seen when a vector is integrated into the host ES cell genome, which effect 
may include silencing of the cDNA resulting in unstable and heterogeneous 
expression. 

An alternative to use of the first episoma! vector is to introduce into the ceil a 
construct that expresses the replication factor and integrates with the cell genome. 
The construct should therefore include a DNA sequence coding for the replication 
factor and means for selection of cells in which the construct has successfully 
integrated; one example is a construct that comprises cDNA coding for, in order, large 
T antigen - an internal ribosome entry site (IRES) - Bgeo. A culture of cells is then 
obtained by selecting for ceils that express the selectable marker, such as in this 
case by selection in G418. Staining with Xgal is used to identify transfectant clones 
which show stable and homogenous expression. The construct preferably comprises 
a promoter that gives stable, low level expression in transfected cells, such as the 
HMGCoA promoter for ES cells. The cells obtained can then be subjected to 
transfection with the second and optionally third and subsequent vectors. 

In another embodiment of the invention the second vector comprises an inducible 
promoter. Some types of differentiated cells, derived from ES cells, can only be 
obtained with any reliability if a particular differentiating factor is expressed after a 
prior event. One example is neurone formation which generally only occurs after 
aggregation of cells. Thus, using an inducible promoter, expression of DNA that 
codes for the factor that leads to neurone formation can be controlled until the ES 
cells have suitably aggregated. Interferon responsive promoters are some examples 
of inducible promoters. Alternatively, the cDNA is designed to be in a non-functional 
form and to be capable of being modified into a functional form at a later time. One 
possibility is for the cDNA to be disrupted for example by termination sequences 
which are flanked by target sites for a site specific recombinase, such as loxP sites, 
removable by Cre recombinase, or frt sites removable by Ftp recombinase. Cre and 
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Flp can be fused to steroid hormone receptors in order to make their activity 
reguiatable. After administration of steroid the Cre or Flp recombinase will translocate 
to the nucleus and there convert the cDNA into a functional form by excision of the 
disrupting sequence. It may also be desired to stop or inhibit or reduce replication 
of the second vector; the method optionally comprises using a site specific 
recombinase to present replication of the second vector. This can be achieved by 
deletion of a sequence in the vector to which the replication factor must bind in order 
for the vector to be replicated by the host ceil. 

The term DNA or cDNA is usually understood to refer to a DNA sequence that is 
transcribed into a mRNA that is translated into a polypeptide or protein. In the present 
invention the term is also Intended to encompass any product of DNA expression. 
It thus includes DNA coding for an antisense RNA f or for an antisense ribozyme 
molecule. 

The method of the invention is suitable for assaying effects of DNA expression, due 
to the stability and efficiency of expression achievable. Accordingly, the invention 
further relates to an assay for the effect of presence in a cell of any product of DNA 
expression - such as protein, polypeptide, antisense RNA, ribozyme RNA, transfer 
RNA or other. The method comprises steps (a) and (b) as described above wherein 
the second vector also contains a DNA coding for a selectable marker. The method 
further comprises selecting for cells that have been transfected with the second 
vector and maintaining the selected ceils over a plurality of generations. 

Step (a) may be carried out once and then steps (b) onwards repeated for different 
assays, and the method is of particular application to screening a cDNA library. 
Furthermore, two or more cDNAs can be expressed in the same cell to assay the 
effect of the combination of their respective expression products. 

The invention also relates to a vector. Accordingly, the invention provides, in a 
second aspect, a vector for transfection of an ES cell, wherein: 

(!) the vector contains a DNA, or is adapted to receive a DNA, in operative 
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combination with a promoter for expression of the DNA; 

(ii) extrachromosoma! replication of the vector is dependant upon presence 
within the ES cell of a replication factor; and 

(iii) the vector does not express the replication factor. 

The vector is characterized in preferred embodiments as described above in relation 
to the second vector of the first aspect of the invention. 

It is an advantage of at feast preferred embodiments of the invention that due to very 
high efficiency of stable secondary transfection (supertransfection) of ceils, for 
example transfection of ES celis harbouring pMGD20neo with a second pfasmid 
containing the polyoma replication origin (On) (8), that expression of DNA is stably 
and efficiently achieved from the second plasmid. 

Another aspect of the present invention provides a method of screening for new 
DNAs that encode signal sequences and proteins that are transported to the cell 
surface, The invention according provides a method of investigating the properties 
of a DNA sequence comprising expressing in a cell a composite DNA including (a) 
the DNA sequence under investigation, linked to (b) a DNA coding for a cell active 
protein, wherein 

activity of the cell active protein is dependant upon transport of the cell active 
protein to the cell surface, and 

the DNA of (b) does not code for a polypeptide capable of directing 
transportation of the cell active protein to the cell surface. 

This offers the advantage that where the DNA of interest does indeed code for a 
sequence that transports a polypeptide to the cell surface, whether that polypeptide 
remains there or is ultimately secreted, this wl be apparent from observation that the 
ceil active protein has had or is having its known effect. Thus the method offers a 
convenient means of identifying DNA sequences that will transport proteins to the cell 
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surface. 

The method is suitably used for screening a library of DNAs to identify DNA 
sequences coding for signal polypeptide sequences that transport proteins to the ceil 
surface. The eel! active protein if transported to the ceil surface may remain there 
or be secreted by the ceil, and this distinction may be separately assayed, or 
example by examination of the make-up of the culture medium before and after the 
investigation, 

One convenient way to obtain the DNA of (b) is by deleting or disabling, from a DNA 
encoding a cell surface or secreted protein, that portion of the DNA that codes for the 
polypeptide sequence responsible for transportation of the protein to the celi surface. 
The cell active protein is optionally a celi surface receptor and the DNA of (b) can 
thus encode a modified form of the receptor preprotein lacking a functional signal 
sequence. In a specific embodiment described below the IL-6 receptor is used as 
expression of the receptor in ES cells can be used to inhibit differentiation of the ceils 
- a readily observable property of the cell active protein. Gross morphological or 
proliferative changes induced in the cell by the cell active protein are of course readily 
observed, though the invention is of application to any cell active protein whose 
activity, when it is transported to the celi surface and / or secreted, can be assayed. 

A specific embodiment of this aspect of the invention comprises expressing the 
composite DNA by: 

{a) (i) transfecting a cell with a first vector that expresses a replication 
factor; or 

(ii) otherwise obtaining a cell that expresses the replication factor; 
(b) transfecting the cell with a second vector, wherein 

(i) the second vector contains the composite DNA in operative 
combination with a promoter for expression of the composite 
DNA; 

(ii) the second vector also contains a DNA coding for a selectable 
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marker in operative combination with a promoter for expression 
of the selectable marker; and 
(iti) extrachromosoma! replication of the second vector is dependant 
upon presence within the ceil of the replication factor; 

(c) selecting for cells that have been transfected with the second vector; 
and 

(d) maintaining the selected ceils over a plurality of generations so as to 
assay the effect of expression of the composite DNA. 

if many investigations are to be carried out it is preferred that step (a) is carried out 
once and the cells obtained are divided and used for a plurality of separate methods 
in which steps (b)-(d) are carried out a plurality of times with second vectors 
containing different DNA sequences. This offers the advantage that typically the first 
transfection step is of lower efficiency than the second, so the method avoids having 
to repeat the low efficiency step too often. 

it is particularly preferred that the method is used for identification of a DNA coding 
for a ceil surface or secreted protein, and using the method to screen a library of 
DNAs provides a means of carrying out the screen for discovery of such DNAs and 
investigation of their properties. More especially, the method is for discovery of 
hitherto unknown or uncharacterized cell surface or secreted proteins, or for location 
of the coding sequence of known proteins of this type. 

This aspect of the invention optionally further incorporates in preferred embodiments 
features of transfection of cells described above in relation to other aspects of the 
present invention. 

The invention enables development of a series of vectors which give highly efficient 
and robust expression of transgenes in cells. Cloned cDNAs of interest can rapidly 
be characterised using this system. It is also applicable to the discovery of novel 
regulatory molecules through functional expression screening of cDNA libraries. 
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Due to their pluripotent and proliferative character, key cellular processes such as 
viability, propagation, determination and differentiation, can be analyzed in transfected 
ES cells. The "supertransfection" system of the invention overcomes the limitations 
associated with conventional cDNA transfection and opens a powerful new route to 
gene discovery and characterisation in mammals. 

Key features of the episomal supertransfection system, described according to the 
examples below, are that very high efficiencies of stabie transfection are obtained and 
that cDNA expression is homogeneous, stable and reliably dictated by promoter 
strength. The increased efficiency of isolating stable transfectants is significant 
because it allows reliable detection of cDNAs whose expression results in ceil death 
or differentiation. In addition a high transfection efficiency is generally advantageous 
for any high throughput assay system and is essential for functional cDNA library 
screening. The reliability of cDNA expression is critical for functional studies and the 
robust nature of expression from episomal vectors contrasts favourably with the 
variable and unstable expression observed in conventional ES cell transfectants. 

Heterogeneous expression of integrated transgenes is not an artefact arising from use 
of bacterial iacZ as a reporter gene, firstly because similar observations have been 
made using mammalian thy-1 as a reporter in F9 cells, and secondly because 
ubiquitous expression of iacZ can readiiy be obtained following gene trap integrations 
(23,24). The expression pattern throughout the population cannot be determined by 
Northern blot but can only be revealed by in situ hybridization or use of a linked 
reporter gene such as IRES~/acZ (25) Heterogeneous expression, which previously 
occurred in the great majority of transfected clones following stable integration, gave 
unclear or misleading results on the phenotypic consequences of transgene 
expression. 

The difference in expression pattern between conventional transfectants and episomai 
supertransfectants of the invention arises because an extrachromosomal copy of a 
transgene is not subject to alteration during the integration process nor to modification 
arising from the genomic sequences flanking an integration site. The so-called 
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"position effect" can modify both the level and pattern of transgene expression in 
stable transfectants. Furthermore, the expression of integrated transgenes is often 
suppressed over several generations in ES ceil cultures. This silencing phenomenon 
contributes to the high backgrounds which can be obtained in double replacement 
type targeting strategies (26) . it has been observed in stable transfectants with 
different transgenes driven by viral promoters or minima! mammalian promoters such 
as the widely used human £~actin and mouse PGK-1 promoter elements. One 
hypothesis to explain this phenomenon is that transgenes may become targets of de 
novo methyltransferase in stem cells (27). Macieod et al. (28) reported that a 
methylation free locus could be generated in transgenic mice by introduction of the 
whole CpG island of the aprt promoter. 

Whatever the molecular mechanism of silencing, it appears not to occur to eptsomaliy 
maintained transgenes in vectors of the invention. In addition, the level of expression 
obtained from vectors of the invention is reliably dictated by promoter strength and 
can predictably be varied over at least a 10-fold range by appropriate choice of 
promoter. Episoma! constructs of the invention thus offer considerable advantages 
for functional expression studies in ES cells. 

Functional cDNA expression cloning is a powerful method for direct isolation of 
important genes. The expression screening approach has often been employed to 
isolate cDNAs encoding surface and secreted molecules via transient expression, for 
example in COS cells. In a few cases EBV-based systems have also been applied 
to isolate intracellular regulatory genes via stable expression in the target cells (29- 
32} , The high efficiency of supertransfection in the polyoma system of the invention 
indicates that this approach could be applied to functional cloning in ES ceils. Based 
on a transfection efficiency of 2.5%, a library of 5x1 0 5 cDNA clones could be 
screened by electro poration of 2x1 0 7 ceils with 10G//g DNA. For an effective library 
screen, the majority of transfectants should only take up a single plasmid. It is also 
advantageous if the cDNAs can readily be recovered in unrearranged form. Both of 
these conditions are satisfied by the episomal supertransfection system. By 
screening libraries prepared from undifferentiated ES cells it may be possible to 
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isolate cDNAs whose products mediate self-renewal. In this case direct selection can 
be applied for coiony formation in the absence of LIF. For cDNAs whose products 
direct differentiation, however, it will be necessary either to screen pools through 
several rounds or to incorporate an inducible promoter into the episome. 

Recently, several improved protocols for in vitro differentiation of ES cells have been 
reported, which promote efficient generation of, for example, haematopoietic cells (33) 
, neurons (34) or cardiomyocytes (35). The episomai expression strategy of the 
invention can be applied for gatn-of-function assays and screens during these 
differentiation programmes. It can also be used for !oss-of-function analyses via 
overexpression of anti-sense RNA or dominant-negative mutants. Combination of 
these differentiation systems with the episomai expression system will therefore 
provide powerful tools for analysing ceil determination and differentiation events. 

The invention is now described with reference to the accompanying drawings in 
which: 

Fig. 1 shows the structure of the episomai expression vector pHPCAG; 
Fig. 2 shows supertransfection efficiency of pHPCAG in MG1.19 ES ceils; 
Fig. 3 shows DNA hybridisation analysis of Hirt supernatants from 
supertransfectants; 

Fig. 4 shows the effect of vector size on supertransfection efficiency; 
Fig. 5 shows expression of /?-galactosidase in MG1.19 transfectants; 
Fig. 6 shows the restriction pattern of piasmid DNAs recovered from 
pHPCAG-/acZ supertransfectant clone; 

Fig. 7 shows induction of differentiation by expression of STAT3F in MG 
1.19 ES cells; 

Fig. 8 shows co-supertransfection of STAT3F with wild type STAT 
expression vectors; 

Fig. 9 shows linker sequences for use in an assay of the invention; 

Fig. 10 shows DNA sequences coding for truncated and modified !L6R; and 

Fig. 11 shows a vector for use in an assay of the invention, 
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in more detail; 

Figure 1 shows the structure of the episomai expression vector pHPCAG. cDNAs can 
be introduced between two BstX\ sites using SsfX! adaptors. Abbreviations: ALT20: 
deleted polyoma large T expression cassette LT20; Pyori/enh: mouse polyoma virus 
replication origin and mouse polyoma mutant enhancer derived from F101 strain; 
SVpA; SV40 potyA addition signal; PGKhphpA: hygromycin B phosphotransferase 
gene expression cassette with mouse phosphogiycerokinase-1 (PGK) promoter and 
polyA addition signal; GAG: combined CAG expression unit; /?-giobinpA: rabbit fi- 
globin polyA addition signal; SVori: SV40 replication origin; CoElori: CofE1 
replication origin; amp: B.coli ^-lactamase gene conferring resistance to ampiciilin. 

Figure 2 shows supertransfection efficiency of pHPCAG in MG1.19 ES cells. 

(A) shows numbers of transfectant colonies per microgram of pHPCAG DNA. 5x1 0 6 
MG1.19 ES celts were supertransfected with the indicated amounts of supercoiled 
pHPCAG followed by selection with hygromycin B for 8 days. The resulting number 
of drug-resistant colonies were scored and efficiency per //g DNA calculated. 

(B) shows total numbers of transfectant colonies plotted against total amount of 
plasmid DNA. 

Figure 3 shows DNA hybridisation analysis of Hirt supernatants from 
supertransfectants. Hirt supernatants were prepared from 5x1 0 6 parental MG1.19 
cells and pooled pHPCAG supertransfectants. 1/20 of each sample was digested 
with either Eco R! or Mncflll and analyzed by filter hybridisation using a 344bp Sea 
l-Sspl fragment from pUC19 which is common to both pMGD20neo and pHPCAG. 

Figure 4 shows the effect of vector size on supertransfection efficiency. 20>g of each 
of the supercoiled vectors pLT20ANde\hph (4.7), pLT2GABs£Xinp/7 (5.5), 
P LT20AA7wMnpn (5.6), pLT20ASadfcp/7 (5.9), ptkp (6.2), pSV40e/p (6.4), 
PGKnpnALT20 (6.5), pmPGKp (6.6), phBAp (6.6), pHPCAG (7.7), ptkp-/acZ (8.9), 



WO 98/32868 



- 16 



PCT/GB98/00216 



pSV40e/p~/acZ(9,1), pmPGKp-/acZ(9.3), phBAp-/acZ (9.3), and pHPCAG-/acZ(10.4) 
were individually supertransfected into 5x1 0 6 MG1.19 ES cells. The resulting 
numbers of hygromycin B resistant colonies were scored after 8 days. Transfection 
efficiencies are normalised relative PGKhphALT20. 

Figure 5 shows expression of jff-galactostdase in MG1.19 transfectants. Primary 
colonies were stained with Xgal after 8 days of selection. 

(A) shows typical homogeneous staining pattern obtained following supertransfection 
with supercoiled pHPCAG-/acZ. 

(B) shows heterogeneous staining pattern obtained in minority of clones following 
supertransfection with supercoiled pHPCAG-/acZ. 

(C) shows heterogeneous staining pattern typically observed following electroporation 
of linearized pHPCAG-/acZ and stable integration. 

(D) shows rare faint staining pattern obtained after supertransfection with supercoiled 
pH PC AG-lacZ. 

Figure 6 shows the restriction pattern of plasmid DNAs recovered from pHPCAG-/acZ 
supertransfectant done. 

A supertransfectant MG1.19 clone carrying pHPCAG-/acZ was cultured for 60 days 
in the presence of hygromycin B. Hirt DNA was then prepared and 
electrotransformed into Ecoii DH10B cells. Plasmid DNAs were recovered from 
transformants, digested with EcoRl, resolved by electrophoresis on 1.0% agarose gel 
and visualised byethidium bromide staining. Expected fragment sizes: pMGD20neo, 
4852bp and 2884bp; pHHPCAG-/acZ, 3697bp s 2810bp, 783bp and 397bp. Lane 1: 
size marker (1kb laddenBRL); lane 2: control pMGD20; lane 3 ; control pHPCAG- 
lacZ; lane 4: recovered pMGD20; lane 5,6; recovered pHPCAG-/acZ. 
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Figure 7 shows induction of differentiation by expression of STAT3F in MG 1.19 ES 
ceils. 

(A) shows proportion of differentiated colonies in LIF-supplemented medium resulting 
from supertransfection of STAT3, antisense STAT3 and STAT3F expression vectors. 
Colonies were fixed and stained with Irishman's reagent after 8 days selection and 
numbers of stem cell colonies and differentiated colonies scored. 

(B) shows marker gene expression in STAT3F supertransfectants: Expression of 
marker genes in pools of MG1.19 celis supertransfected with STAT3 (lane 1), STAT3 
antisense (iane 2) and STAT3F (iane 3} expression vectors. Total RNA was prepared 
after 8 days of selection in LIF-supplemented medium and 5>g aliquots analyzed by 
filter hybridisation with /j-globin, Rex-1, H19 and G3PDH probes. The /?-globin probe 
detects al! transgene mRNA species generated from pHPCAG, including an 
alternatively spliced product from the antisense construct. 

(C) shows photomicrographs of representative colonies 8 days after supertransfection 
with (i) STAT3, (ii) STAT3F, and (iti) empty expression vectors and selection in the 
presence of LIF, or, (iv) induction of differentiation by culture in the absence of LiF 
for 8 days. 

Figure 8 shows co-supertransfection of STAT3F with wild type STAT expression 
vectors. Proportions of undifferentiated stem celi colonies generated after co- 
supertransfection of MG1.19 ES cells with 10pg pBPCAGGS-STATSF plus 10/ig 
pHPCAG vector containing stuffier (control), STAT3, STAT1 or STAT4 inserts. After 
8 days selection with 80/jg/ml of hygromycin B plus 20//g/ml of blasticidin S, colonies 
were fixed and stained with Leishman's reagent. 
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EXAMPLE 1 
Materials and Methods 
Vector constructions. 

Standard recombinant DNA methods were used to construct all plasmids(10) . 
Plasmid pHPCAG (Fig 1) was constructed from pMGD20neo(8) . The PGKneopolyA 
sequence was replaced by a hygromycin resistance marker, PGKhphpA, and large 
T sequences were deleted (see Results). A Sa!\-Sca\ fragment containing the CAG 
expression unit, a BsfXl stuff er sequence, the poiyA addition signal derived from the 
rabbit /?~globin gene and an SV40 replication origin (11) was inserted. Coding 
sequences for /?-galactostdase, LIF or interleukin-2 were introduced between the 
BstXI sites. 

For construction of episomai expression vectors with alternative promoters, the Sa/i- 
Xbal fragment containing the CAG expression unit in pHPCAG-tecZ was replaced with 
the 344 bp SV40 enhancer/promoter (SV40e/p), the 466 bp human yff-actin promoter 
(hBA), the 502 bp mouse phosphoglycerate kinase promoter (mPGK) and the 90 bp 
HSV-tk minima! promoter (tk), resulting in pHPSV40e/p-/acZ, pHPhBA-facZ, 
pHPmPGK-/acZ and pHPtk-/acZ, respectively. 

Episomai vectors with alternative selection markers were constructed by replacing the 
PGKhphpA cassette in pHPCAG with the SVbsrpA cassette carrying the E.coii 
blasticidin S deaminase (bsr) gene derived from pSV2bsr (Waken Seiyaku) or the 
hCMVzeopA cassette carrying the Streptoalloteichus bleomycin resistant gene (Sh 
bie) derived from pZeoSV (Invitrogen) to generate pBPCAGGS and pZPCAGGS, 
respectively. 

Cell culture and transfeciion. 

MG1.19 ES cells are derivatives of the CCE line which stably maintain around 20 
episomai copies of pMGDneo(8) . They were maintained on gelatin-coated plates in 
Glasgow modified Eagle's medium (GMEM, Gibco-BRL) supplemented with 10% fetal 
calf serum, 0.1 mM /?-mercaptoethanol, non-essential amino acids, 200 /ig/ml G418, 
and 100U/ml LIF produced in COS-7 cells(11,12) . For superinfection, routinely, 
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5x1 0 6 MG1.19 celis were suspended in 800 //I of PBS, incubated with 20 //g of 
supercotled vector DNA for 10 min on ice, and eiectroporated at 200V/960yc/F using 
a Bio-Rad gene pulser. Ceils were transferred into gelatinized plates and allowed to 
recover overnight before addition of appropriate selection agent. Histochemical 
staining for jff-galactostdase was carried out with 5-bromo-4-diioro-3-indolyl 0-U~ 
galactopyranoside (X-gal) (13) , and yS-galactosidase activity was measured by 
incubation of cell extracts with o-nitrophenyi-yS-D-galactopyranoside (ONPG). 
Differentiation was induced in monolayer culture as described (12) . 

Analysis of episomal vectors in the supertransfectants. 

Hirt supernatants were prepared as described (14) . For amplification of recovered 
episomal vectors, eiectrocompetent £ colt DH108 ceils were transformed by 
electroporation at 25Q0V/25>F/200 , / a . 

Results 

Construction of an episomal expression vector. 

Polyoma-based plasmids have recently been reported to be competent for episomal 
propagation in ES cells (8) . The plasmid pMGD20neo contains a modified large T 
expression unit called LT20, the viral origin of repiication (On), and the PGKneopA 
cassette as a selectable marker. This piasmid can be maintained as an 
extrachromosomai element in wild-type ES cells. It can be modified to include a 
cDNA expression unit (9) . However, the low frequency of conventional stable 
transfection of ES cells (A 1x10*) remains a limiting feature. Furthermore, episomal 
propagation only occurs in 10-15% of primary transfectants (8,9) . 

A second plasmid has been described which can be maintained as an episome only 
in ES cells which independently" express the large T protein (8) . This plasmid, 
PGKhphALT20, contains LT20 with a large deletion in its coding sequence, On, and 
PGKhpftpA as a selectable marker. When introduced into a cell line such as MG1.19, 
in which episomal maintenance of pMGDneo has already been established, the yield 
of hygromycin B resistant stable transfectants is extremely high. This phenomenon 
of supertransfection is presumed to arise from the pre-existence of large T protein in 
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the recipient cells. 

in the studies reported below the modification and use of supertransfection vectors 
for cDNA expression is characterised. 

Size of vector 

PGKf?pr?ALT20 retains part of the large T coding sequence. We made a series of 
deletions in the ALT20 sequence to minimize the vector size and thereby increase 
the capacity for inserts and reduce potential bias in the construction and screening 
of cDNA libraries. The supertransfection efficiency of four derivative plasmids was 
then compared in MG1.19 cells. All showed comparable supertransfection efficiency 
to PGK/ipMLT20 (data not shown). The smallest, pLT20A/Vdelr)p/7, has a deletion 
of 2953 bp, yielding an episomal vector backbone of only 4.7kb. 

Expression unit 

Into this minimal episomal vector we introduced a cDNA expression unit. 
Transcriptional initiation signals are supplied by the CAG cassette{11) , which 
comprises the human cytomegalovirus immediate eariy enhancer, a 1Kb fragment of 
the chicken £-actin gene (promoter, non-coding first exon and first intron), and a 
splice acceptor derived from the rabbit #-g!obin gene. This combination has been 
shown to direct strong expression of cDNAs in undifferentiated stem cells. The 
resulting expression vector, pHPCAG (Fig 1), contains the CAG sequences followed 
by the SsfXi stuffer sequence derived from pCDMS as a cDNA cloning site, and a 
polyA addition signal derived from the rabbit jff-globin gene. In addition the plasmid 
contains the PGKhphpA (15) cassette for hygromycin selection of ES ceil 
transfectants, the poiyomaOr/with pyF 101 -derived mutant enhancer element (16) for 
stable episomal replication in ceils expressing polyoma large T protein, and the 0- 
lactamase (amp) gene and prokaryotic replication origin for amplification in £. colL 
The SV40 Or/ is also present to allow for transient episomal replication in mammalian 
host cells expressing SV40 largeT, such as COS cells (17) . 
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Characterization of supertransfection. 

The parameters of supertransfection with pHPCAG and derivatives were investigated. 
First, 5x10 s MG1.19 cells were electroporated with various amount of supercoiied 
pHPCAG, selected in medium containing 80 //g/ml of hygromycin B for 8 days, and 
the number of stem ceil colonies scored after Leishman's staining(12) . Although the 
highest efficiency per //g DNA was observed with minimum amounts (1-2 mQ) of 
vector DNA (Fig. 2B), the total yield of hygromycin B resistant colonies increased with 
increasing amount of plasmid (Fig 2A), Saturation was not reached over the range 
of plasmid concentrations tested. With 100 //g piasmid DNA, 150,000 hygromycin B- 
resistant colonies were obtained, representing 3% of total treated ceils. Disablement 
for episomal replication by linearisation of pHPCAG prior to electroporation reduced 
this transfection efficiency to less than 0.01%. 

Next, increasing numbers of MG 1.19 cells were subjected to electroporation with 1 00 
fjg of pHPCAG DNA. Comparable stable transfection efficiencies in the range 3-6% 
were obtained with up to 2.5x1 0 7 cells. 

The copy number of pHPCAG in the supertransfectants was analyzed by preparation 
of Hirt supernatants followed by filter hybridisation. This analysis revealed that 
supertransfected ceils carried approximately 20 copies each of pMGDneo and 
pHPCAG (Fig. 3). 

These data demonstrate that the efficiency of supertransfection with pHPCAG is 
extremely high. However, episomal vectors can be limited in their capacity for inserts 
because increased size may cause inefficient replication or instability. To investigate 
this issue in the ES cell system, episomal vectors of different size were 
supertransfected into MG 1 .1 9 cells. The numbers of supertransfectant colonies were 
scored and plotted against vector size (Fig. 4). These data indicate that there is a 
progressive reduction in transfection efficiency with increasing plasmid size, in 
particular, the largest plasmid tested, a derivative of pHPCAG with a 3kb lad insert 
(total size 10.4kb) showed a 50% reduction in colony number. However, that this 
may not be due entirely to the size of the piasmid because the very high levels of fi~ 
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galactosidase expression may exert some toxic effects (see below). 



lacZ expression in supertransfectants, 

To evaluate the levei and pattern of expression of transgenes from pHPCAG, the 
E.coli /?-galactosidase (/acZ) gene was introduced into this vector. The resulting 
vector, pHPCAG-/acZ, was introduced into MG1.19 ceiis and supertransfectants 
isolated by selection with 80 //g/m! of hygromycin 8 for 8 days. The number of 
colonies isolated was 50% of the number obtained in a parallel supertransfection with 
pHPCAG (see above). The colonies were smaller and many of the ceils showed an 
abnormal spindle-shaped morphology. These effects were not observed with several 
other inserts in pHPCAG and are suggestive of a toxic effect of the high levei lacZ 
expression. The primary supertransfectants were stained with X-gai and the staining 
pattern examined under phase-contrast microscopy. Staining was detectable after 
5 minutes incubation and was intense by 1 hour. This ievei of £-galactosidase 
activity is significantly higher than we have observed from a variety of integrated 
expression constructs. 

Approximately 80% of supertransfectant colonies showed ubiquitous expression 
(>90% celi positive) as shown in Fig.5-A (i). Of the remainder, 15% showed 
heterogeneous expression (Fig.5-A (ii)), and 5% showed little or no staining (Fig.5-A 
(iv)). The latter two classes are likely to arise as a result of vector integration which 
occurs in up to 20% of supertransfectants (8). In transfectants derived by 
eiectroporation of linearized pHPCAG-fecZ into MG1.19 cells {which results in vector 
integration in the majority of clones), only 15 % of colonies showed homogeneous 
staining whereas 70% of colonies stained heterogeneously (Fig.5-A (iii)), and 15% 
showed no expression. 

Analysis of expanded clones from each class of transfectant established that this 
difference in expression characteristics was stable. Twelve of 13 expanded 
supertransfectants expressed lacZ homogeneously, in contrast, only 4 out of 24 
clones derived using linearized vector showed homogeneous expression. This is 
consistent with our previous observations on integrated expression constructs in ES 
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ceils, fn fact the CAG unit gives a significantly higher frequency of colonies which 
show stable ubiquitous expression than other promoters we have examined. 

The difference in staining pattern between episomaily maintained and integrated 
vectors indicates that the former escape modifying influences arising from integration 
and reliably give full activity of the expression unit. 

Comparison of expression with various promoters on episomai vector. 

An ability reliably to generate predetermined levels of expression wouid be a 
important attribute for a transgene expression system. The previous observations 
suggested that episomai vectors offered potential to achieve unmodified expression. 
Various promoters with different strengths in undifferentiated stem ceils were 
therefore introduced into the episomai vector by replacing the CAG expression unit 
of pHPCAG-/acZ. Expression of the iacZ reporter was then assayed in both transient 
and stable supertransfectants (Table 1). The relative ratio of /?-gaiactosidase activity 
obtained from the SV40 enhancer/promoter complex, the human jG-actin promoter, 
the mouse PGK-1 promoter and the HSV-tk minimal promoter in transient transfectant 
was retained In stable supertransfectants. The CAG expression unit showed 
strongest activity in the tested constructs in both transient and stable transfectants. 
In this case, however, the relative ratio in transient transfectants, 19 times higher than 
SV40, was significantly reduced in stable transfectants. This may arise from an 
elimination of strong expressants due to a toxic effect of high IacZ expression (see 
above). A reduced number of supertransfectants and smaller size of colonies was 
observed only with the CAG vector. 

Stability of supertransfected episomai expression vector during long-term 
culture and differentiation of host cells. 

A critical limitation of previously described episomai vectors is their instability during 
long-term culture. Many episomai vectors undergo integration into the host genome 
after long-term culture, resulting in a reduction in expression and inability to recover 
transgenes by preparing Htrt supernatants. To test the stability of the 
supertransfection system, four pHPCAG-/acZ supertransfectant clones were cultured 
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for 60 days (approximately 90 generations) under continuous selection with 80 //g/mi 
of hygromycin B. Three of the four clones maintained relatively constant levels of 0- 
galactosidase activity determined by ONPG assay and uniform expression as 
revealed by Xgal staining. The fourth clone showed unstable and variegated 
expression, as commonly observed on vector integration. Hirt supematants were 
prepared from one of the stably expressing clones at the end of the 60 day culture 
period. Filter hybridization analysis of the Hirt DNA indicated that the ES cells carried 
approximately 20 copies of pMGD20 and 5 copies of pHPCAG-/acZ per cell (data not 
shown). The lower copy number of pHPCAG~/acZ may be due to its larger size 
and/or the toxic effect of strong iacZ expression. The Hirt DNA was transformed into 
Ecoii for further analysis. Of the bacterial transformants, 20% carried pHPCAG-ZaeZ 
and the remainder carried pMGDr>eo2G, in good agreement with the hybridization 
data. Restriction mapping showed no evidence of rearrangement in either plasmid 
(Figure 6). 

In the experiment above, cells were maintained under selection with hygromycin B. 
in the absence of selection pressure, supertransfectant clones lost expression of /?- 
gaiactosidase over several passages in culture. This might indicate an intrinsic 
instability of supertransfected episomal vectors. However, it could aiso reflect a 
selective disadvantage for ES ceils which express high levels of 0-gaIactosidase. It 
is noteworthy in this regard that the primary episome, pMGD20neo, is stable in the 
absence of sefection(8) . 

Stability of expression from pHPCAG-/acZ during the in vitro differentiation of ES cells 
was also analyzed. Differentiation was induced in three ways: withdrawal of LIF; 
exposure to retinoic acid; and treatment with 3-methoxybenzamide(18) . After 6 days 
the differentiated progeny stained ubiquitously in all three cases (data not shown). 

These data indicate that supertransfected episoma! vectors can be maintained in an 
extrachromosomal state and direct strong expression of transgenes during long-term 
self-renewal and differentiation in vitro. 
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Production and secretion of the cytokine LIF from an episomal ES ceil 
expression vector. 

The pHPCAG-/acZ plasmid can efficiently direct strong and homogeneous expression 
of the cytoplasmic lacZ reporter gene. We next investigated expression of a secreted 
molecule, the cytokine LIF. LIF is an essential supplement to ES cell culture medium 
because it inhibits differentiation of the stem ceils (19,20) . Expression of LIF can 
readily be assayed by formation of stem cell colonies in media lacking the cytokine. 

Episomal vectors for expression of another cytokine, inteheukin-2 (which has no 
effect on ES cell phenotype), and for LIF were electroporated in parallel into MG1.19 
cells. The cells were seeded at low density (1 .5x1 0 4 and 5x1 0 3 ceils per 90mm plate) 
to avoid the rescue effect which arises from the production of LIF by differentiated ES 
cell progeny (21 ) , and cultured with 80 ;/g/ml of hygromycin B for 8 days. pHPCAG- 
U2 generated large numbers of stem ceil colonies in medium supplemented with LIF, 
but none in the absence of LIF. pHPCAG-//? in contrast produced comparable 
numbers of healthy stem cell colonies in both the presence and absence of 
exogenous LIF (Table 2). These colonies could be expanded and propagated without 
LlF-supplementation of the medium. These data confirm previous observations that 
increased autocrine expression of LIF renders ES cells factor-independent (22) and 
establish that secreted proteins are produced efficiently and stably by this episomal 
expression system. 

Co-supertransfection of episomal vectors. 

Introduction of two or more different transgenes into cells is often required for 
analysis of protein interactions and/or co-operative function. The poor efficiency of 
homogeneous expression in conventional transfectants is a major obstacle for such 
investigations in ES cells. To test the possibility that the episomal approach could 
be applied to co-express multiple cDNAs, we constructed episomal expression 
vectors with different selection markers. Co-supertransfection of. episomal vectors 
was then assessed. 



The basic episomal expression vector pHPCAG carries the hygromycin 
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phosphotransferase gene driven by mouse PGK-1 promoter (PGK/jpftpA). We 
prepared episomal vectors which carry the zeocin-resistance gene driven by the 
human cytomegalovirus immediate-early promoter (pZPCAG), or the blasticidin S- 
resistance gene driven by the SV40 enhancer/promoter (pBPCAG) by substitution 
of the PGKripnpA cassette in pHPCAG. These vectors were supertransfected into 
MG1.19 cells followed by 8 days selection with the appropriate antibiotic. 
Comparison of the numbers of resulting drug-resistant coionies (Table 3) revealed 
that these selection systems are slightly less efficient than hygromycin B selection but 
nonetheless enable large numbers of supertransfectants to be isolated, 

ES cells harbouring two different episomal vectors can be isolated by repeated 
supertransfection. Supertransfectants carrying pHPCAG can be transfected again 
with pBPCAG or pZPCAG, with comparable efficiency to the original supertransfection 
into MG1.19 ES cells (data not shown). This should allow establishment of efficient 
screens for assaying functional interactions between gene products. 

The effects of co-electoporation of supertransfection vectors were also investigated. 
pHPCAG (10 pg) and pBPCAG (10 pg) were co-electroporated into 5x1 0 a MG1.19 
ceils. Cells were selected in hygromycin B or blasticidin S only, or both, for 8 days 
and the number of drug-resistant colonies scored in each case. The numbers of 
hygromycin or blasticidin S single-resistant colonies were 39,000 and 13,000, 
respectively, while the number of double-resistant colonies was 1,200. Thus the 
apparent efficiency of incorporation of both plasmids was less than 10%. Similar 
results were obtained on co-supertransfection of pHPCAG and pZPCAG (not shown). 
These data suggest that the majority of supertransfectants incorporate only one 
plasmid under these electroporation conditions. This is significant for application of 
the episomal system to functional cDNA library screening. 

EXAMPLE 2 

The effects of overexpression of a large number of transgenes in ES cells were 
investigated by construction of vectors based on pHPCAG and including a DNA insert 
coding for the transgene being investigated. 5 x 10 s ES MG1.19 cells were 
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supertransfected with 20 //g of expression vectors and selected with 80 //g/mt of 
hygromycin B for 8 days. The numbers of drug-resistant colonies were counted and 
normalised relative to numbers obtained with empty vector. The results are shown in 
Table 4. 

EXAMPLE 3 

Inhibition of STAT3 activation blocks seif-renewai and promotes differe ntiation 

To assess directly the requirement for STAT3 activation in ES cell self-renewal, we 
exploited a dominant interfering mutant form of STAT3, STAT3F. In this mutant 
(Minami et a/., 1996), the tyrosine residue at amino acid position 705 is mutated to 
phenylalanine. Phosphorylation of Tyr705 is required for dimerization and nuclear 
translocation. When expressed at high level, STAT3F has been shown to block the 
activation of endogenous STAT3 in various ceil types, possibly by titrating out 
receptor docking sites (Fukada et aL, 1996; Minami et a/., 1996; Nakajima ef a/., 
1996; Bonni ef a/., 1997; lhara et a/., 1997). 

Using conventional transfection approaches we were unable to recover ES cell 
transfectants showing stable high level expression of STAT3F. In parallel 
experiments, however, transfection of the LIF-independent embryonal carcinoma cell 
line P19 yielded multiple expressing clones. This suggested that blockade of STATS 
activation in ES cells specifically resulted in cell death, growth arrest or differentiation. 
The transfection and expression strategy of the invention was therefore adopted to 
enable characterisation of the consequences of STAT3F expression. 

The STAT3F mutant cDNA was introduced into the supertransfection vector pHPCAG. 
The wild type STAT3 coding sequence was also introduced, in both sense and 
antisense orientations. The three constructs were electroporated into MG1.19 ceils 
which harbour a targe T expression plasmid and can be supertransfected with 
constructs containing the polyoma origin {Gassmann etal., 1995). Supertransfectants 
were isolated by selection in hygromycin B for 8 days in the presence of LIF. 
Colonies were fixed, stained with Irishman's reagent, counted, and scored for the 
presence of stem cells and differentiated cells. More than 95% of colonies obtained 
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following supertra refaction with controi or wild type STAT3 vector were stem eel! 
colonies {Figure 7A). A modest increase in the proportion of differentiated colonies 
was obtained with the antisense construct. The STAT3F vector, however, yielded 
predominantly differentiated colonies. A decrease in total number of coionies was also 
observed after supertransfection with STAT3F, This may reflect an early onset of 
differentiation which would produce very small clones that would not be scored. 
Alternatively, very high levels of STAT3F expression may also be toxic, though this 
has not been reported in other cell types. Morphologically, the differentiated STAT3F 
colonies closely resembled the differentiated colonies generated on culture of ES 
cells in the absence of UF (Figure 7C). Various other cDNAs have been expressed 
in ES cells using this system, with little or no effect on differentiation {data not 
shown). This suggested that the effect on differentiation was specifically attributable 
to expression of STAT3F. 

The differentiation induced by expression of STAT3F was examined further by 
expression analysis of the marker genes rexl and H19. Rex-1 mRNA, which is 
specifically expressed in undifferentiated stem cells, was down regulated in STAT3F 
supertransfectants. In contrast, H19 RNA which is found at low levels in stem cells 
but is upregulated during differentiation, was increased (Figure 7B). A similar pattern 
of gene regulation is observed during differentiation of ES cells induced by withdrawal 
of LIF. These data confirm that the morphological differentiation triggered by STAT3F 
is accompanied by reprogramming of gene expression. 

STAT3F was also expressed from the mouse phosphoglycerate kinase (pgk-1) 
promoter in the episomal vector pHPPGK. This vector gives at least 10-fold lower 
expression than pHPCAG (data not shown), in this case, there was no significant 
effect on either colony number or differentiation status of MG1 .19 supertransfectants. 
A critical level of expression of the dominant interfering mutant therefore appears 
necessary to block self-renewal. 
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Effect of STAT3F on self-renew al is suppressed by co-expression of STAT3 

To test whether the induction of differentiation by expression of STATS F was due to 
an inhibition of endogenous STATS activity, we attempted to rescue the stem ceil 
phenotype by co-expression of wild type STATS and also of STAT1 and STAT4. A 
STAT3F expression vector carrying a blasticidin resistance marker was co- 
supertransfected into MG1.19 ceils with episomal constructs for expression of wild 
type STATs and hygromycin resistance. Co-supertransfectants were isolated in 
medium containing both 20/jg/ml of blasticidin S and 80//g/m! of hygromycin B. The 
numbers of stem cell and differentiated coionies were scored after 8 days. As shown 
in Figure 8, only co-expression of wild type STAT3 restored self-renewal in the 
presence of STAT3F. Transfection with STAT1 or STAT4 constructs alone had no 
effect on self-renewal in the absence of STAT3F (not shown) and did not alter 
differentiation induced by STAT3F. in the case of supertransfection with the CAG 
promoter STAT1 construct, the total number of colonies (stem + differentiated) 
recovered was reduced but the relative proportion of stem cell colonies versus 
differentiated cells was unaltered. This occurred in both the presence and absence 
of co-expression of STAT3F, and suggests that high level expression of STAT1 may 
be toxic to ES cells. By using the mouse PGK-1 promoter to drive lower levels of 
expression comparable numbers of colonies were recovered on transfection with the 
STAT1 as with the other constructs. In this case, again only the STATS construct 
showed any restoration of stem cell colonies, although to a lower degree than with 
the high expression CAG vector (not shown). These data indicate that STAT3 has a 
specific function in ES ceils which cannot be compensated by STAT1 or STAT4. 

EXAMPLE 4 

The invention is aiso used in a strategy for direct selection of genes that code for 
secreted and eel! surface proteins, in one example of this strategy, the basic cloning 
vector is a truncated form of 1L6R that lacks a signal sequence. This vector is 
described in detail below and shown in Fig. 11. if this truncated 1L6R is expressed 
in ES cells, it is not exported to the ceil surface and these celts differentiate when 
cultured in IL6. However, if the IL6R signal sequence is reconstituted by a signal 
sequence provided by a cDNA fragments cloned in frame at the 5' end of the 
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truncated IL6R, the chimaeric receptor is expressed on the surface of ES cells. ES 
cells containing such chimaeric receptors are thus maintained as undifferentiated 
colonies when cultured in !L6. 

Libraries of short, 5' cDNA fragments are produced and cloned into a truncated and 
modified !L6R-based expression vector. ES cells transformed with such libraries 
express cDNA:IL6R fusion proteins. However, oniy cDNAs that encode signal 
sequences confer IL6 responsiveness on ES cells. These cDNAs alone give rise to 
undifferentiated, proliferating ES eel! clones. This strategy therefore provides a direct 
selection for cDNAs encoding secreted and cell surface proteins. 

The chimaeric !L6R is expressed in the episomal expression system described above 
(or a derivative thereof). This allows drug selection for episomaliy transformed cells 
and high level expression of cloned DNA. 

To further refine the selection system, ES cells are modified with two targeted 
mutations: 

a) A selectable marker gene, for example the blasticidin resistance gene, is 
introduced into the OCT-4 locus by standard targeting techniques. Since Oct-4 is 
expressed in undifferentiated ES cells, the blasticidin resistance gene will be 
expressed only by undifferentiated colonies. Blasticidin selection therefore is used to 
decrease background growth by ensuring rapid deletion of differentiating, Oct-4 
negative, ES cells. 

b) Since ES cells can produce LIF as an autocrine growth factor, ES cells are used 
in which both copies of the UFR gene have been disrupted by gene targeting. This 
eliminates the possibility of LIF-dependent, false positive colonies that might 
otherwise persist throughout selection in IL6. 



Details of vec tor construction: 

1). 1L6R was cloned into the episomal vector pCAGlP or a derivative (pCAGIPXN, 
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i.e. pCAGIP with a destroyed Not! site), pCAGIP contains an internal ribosome entry 
site (IRES) and a puromycin resistance gene downstream of its multiple cfoning site, 
resulting in stoichiometric production of cDNA:iL6R fusion proteins in transfected cells 
under puromycin selection. !L6R in pCAGiP provides a positive control (IL6- 
responsive functionai protein on the cell surface), and the basis of the new vector. 

2) . To construct the cloning vector, IL6R cDNA was truncated by cleavage with 
BssHli at nucleotide number 92. This deleted the initiator ATG and sequences 
encoding the signal sequence. 

3) . To minimise potential steric interference by cloned proteins with !L6 binding and 
IL6R function, DNA encoding a synthetic flexible linker peptide was then added to the 
5' end of the truncated 1L6R. Two alternative Sinkers have been used: gly giy gly gly 
ser gly gly giy giy ser and a linker containing the FLAG epitope, gly ser ASP TYR 
LYS ASP ASP ASP ASP LYS (FLAG epitope in upper case). The sequence of these 
linkers is shown in Fig. 9. in each case, the linker sequence has been cloned in 
frame with IL6R and has two unique cloning sites {Xhol and Not!) at its 5' end, 
allowing the introduction of cDNA libraries, or specific cloned sequences, in a 
directional manner. The FLAG epitope is recognised by a commercially available 
monoclonal antibody {M2; available from !BI/Kodak) regardless of its position within 
a fusion protein, and wil! thus allow the expression levels of surface protein to be 
measured directly by immunocytochemistry. 

4) , Vectors containing each of these linkers and an upstream signal sequence are 
tested for relative expression level and lL6R-funetion, as detailed beiow. 

To test the utility of these vectors for selecting proteins expressed at the cell surface; 
a number of known signal sequences are cloned into each vector. These are tested 
for surface, expression and SL6R function. Signal sequences include those from rat 
CD4 (a protein with extracellular !g domains), mouse sek {a receptor tyrosine kinase, 
with no extracellular !g domains) and mouse sonic hedgehog (a secreted factor). 
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ES cells are transfected with vectors bearing candidate signal sequences by 
lipofection or electroporation, followed by puromycin selection for transfected ceils. 
After overnight growth in the presence of L1F, to maintain the undifferentiated state 
and proliferation, transfected ceils are split into three groups and treated with either 
1) LIF, 2) 116 or 3) neither growth factor. Only cells bearing 1L6R brought to the cell 
surface by a fused signal peptide will proliferate in the presence of 1L6. Positive 
controls include ES cells transfected with wild-type IL6R grown in the absence of LIF 
and the presence of 1L6. Negative controls include empty vector (i.e truncated IL6R 
with no 5' insert) grown in the presence of IL6. To determine whether fusion proteins 
N-terminal to IL6R block signalling (by steric hindrance), the proportion of such ceils 
that express surface protein but fail to proliferate in response to IL6 is deduced by 
comparing the number of cells expressing the FLAG epitope with the number that 
give rise to colonies. 

Vectors defined by this assay are then used in cDNA library screens. Preferably, 
sequences corresponding to 5' ends of cDNAs are generated from full length cDNA 
libraries and directionaiiy cloned in the screening vector. 
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We have thus described the development of an optimised transfection and expression 
system which wilt enable high throughput functional screening of cDNAs in 
piuripotential mouse embryonic stem (ES) cells and differentiated derivatives. The 
strategy is based on extrachromosomaf vector replication driven by expression of 
polyoma large T protein. When a vector containing a polyoma origin of replication 
is introduced into an ES cell line that harbours polyoma large T antigen, a high 
frequency of stable secondary transfection results. This process is referred to as 
supertransfectton. Supertransfected plasmids can be maintained episomally during 
!ong4erm culture and during differentiation in vitro. Expression of a /?-galacfosidase 
reporter from an eptsoma! vector is both ubiquitous and stable, in contrast to the 
variegated and unstable expression usually observed after cDNA integration into the 
ES ceil genome. Moreover, in the absence of integration, promoter strength is 
predictable and a range of expression levels can reliably be achieved by using 
different elements. We also show that episomal vectors can be used for efficient 
expression of both cytosolic and secreted proteins. These features should make this 
system invaluable for functional analyses of defined cDNAs and for direct expression 
screening of cDNA pools or libraries in ES cells. 
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Table 1. Comparison of jff-galactosidase activities directed by various promoters in 
transient and stable supertransfectants. 



Promoter Relative 0-gal activity 

transient stable 

SV40e/p 1.0 1.0 

h/?Ap 1.1 0.7 

mPGKp 0.5 0.5 

TKp 0.1 0.1 

CAG 19.0 1.8 



5x10 6 MG1.19 ES celts were supertransfected with 20j/g of vector DNAs. After 3 
days culture for transient expression assay or 8 days selection with hygromycin B for 
stable expression assay, the jff-gaiactosidase activity generated by these constructs 
was measured by ONPG assay. Results are normalised relative to activity generated 
by the SV40e/p construct. See 'Materials and methods' for construction details of 
vectors. 



WO 98/32868 



- 36 ~ 



PCT/GB98/G0216 



Table 2. Supertransfection of LIF and 1L-2 expression vectors into MG1.19 
ES ceSis. 



Vector LIF in medium No. of hyg r stem celi colonies 

pHPCAG-/// + 42,000 

pHPCAG-//? - 38,000 

pHPCAG-Z/2 + 48,000 

pHPCAG-//2 - 0 

5x10 s MG1.19 ES cells were supertransfected with IQjjq of vector DNAs. After 8 
days selection with 80//g/ml of hygromycin B in the presence or absence of LIF, the 
number of stem ceil colonies were scored. 
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Table 3. Efficiency of supertransfection of vectors with various selection markers. 

Selection marker Drug for selection (/t/g/ml) No. of resistant colonies 

PGKnpnpA hygromycin B (80) 50,000 

SVbsrpA biasticidin S (4) 12,600 

hCMVzeopA zeocin (20) 20,600 

5x10 6 MG1.19 ES ceils were supertransfected with 20//g of vector DNAs of episoma! 
vectors, pBPCAG and pZPCAG, which carry bsr and zeo resistance genes 
respectively. After 8 days selection with the appropriate drug, the number of drug- 
resistant stem ceii colonies were scored. 
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Table 4. Effects of overexpression of transgenes in ES ceils using pHPCAG, 



cDNA 


Relative number of 
hygro R colonies 


Colony Size and 
Morphology 


None 


1.00 


Normal 


iacZ 


0.64 


small 


DIA/LIF 


0.87 


slightly small 


IL-2 


0.92 


slightly small 


Rex-1 


0.88 


Normal 


Fgf-2 


0.65 


Normal 


Fgf-4 


0.82 


Normal 


Fgf-5 


0.41 


Normal 


Oct-1 


0.17 


small 


Oct-2 


0.65 


slightly small 


Oct-3/4 


0.61 


differentiated 


Oct-6 


0.03 


some differentiation 


c-jun 


0.47 


small 


E1A 


0.08 


differentiated 


Jak2 K/E 


0.75 


Normal 


bcl-2 


0.28 


small, spindle morphology 


MAPKP 


1.38 


Normal 


RXRa 


0.20 


some differentiation 


RXR^ 


0.63 


Normal 


RXRy 


0.91 


Normal 


COUP-TF1 


0.40 


some differentiation 


HNF-4 


0.05 


Normal 


Statl 


0.10 


small 


StatS 


0.52 


Normal 


Stat4 


0.16 


Normal 


Stat3DON* 


0.14 


differentiated 



5x1 0 6 ES MG1 .19 cells were supertransfected with 20 u.g of expression vectors and selected 
with 80 u.g/ml of hygromycin B for 8 days. The numbers of drug-resistant colonies were 
counted and normalised relative to numbers obtained with empty vector. 
StatSDON is the dominant interfering mutant form of Stat3 described by Akira et at. (1996). 
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Claims 

1 . A method of expressing a DMA in a ceil, comprising: 

(a) (i) transfecting the cell with a first vector that expresses a 

replication factor; or 
(ii) otherwise obtaining a cell that expresses or will express the 
replication factor; 

and 

(b) transfecting the ceii with a second vector, wherein 

(i) the second vector contains a DNA, or is adapted to receive a 
DNA, in operative combination with a promoter for expression of 
the DNA; and 

(ii) extrachromosomal replication of the second vector is dependant 
upon presence within the cell of the replication factor. 

2, A method according to Claim 1 wherein the replication factor is a viral 
replication factor. 

3. A method according to claim 1 or 2 wherein the viral replication factor is 
selected from polyoma large T antigen, EBNA-1 antigen, papilloma virus 
replication factors, SV40 large T antigen and functional variants, analogues 
and derivatives thereof appropriate to the cell species. 

4, A method according to any of claims 1-3 wherein the second vector does not 
express the replication factor. 

5, A method according to any of claims 1-4 wherein the second vector expresses 
a selectable marker. 

6. A method according to any of claims 1-5 further comprising transfecting the 
ceil with a third vector, wherein the third vector contains a DNA, or is adapted 
to receive a DNA, in operative combination with a promoter for expression of 
the DNA, and replication of the third vector is dependant upon presence within 
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the ceil of the replication factor. 
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7. A method according to Claim 6 wherein the third vector expresses a selectable 
marker, which selectable marker is different to that expressed by the second 
vector. 

8. A method according to any preceding claim wherein the eel! is a mammalian 
cell or an avian ceil. 

9. A method according to any preceding claim wherein the cell is an embryonic 
cell. 

10. A method according to Claim 9 wherein the ceil is an ES, EC or EG cell. 

11. A method according to any preceding claim for transfection of an ES ceil 
wherein the ES cell of step (a) expresses polyoma large T antigen and the 
second vector comprises a natural target for polyoma large T antigen, such as 
On or functional variants thereof adapted to bind to polyoma large T antigen. 

12. A method according to any preceding cfaim wherein the DNA codes for a 
polypeptide or protein. 

13. A method according to any of Claims 1-11 wherein the DNA codes for an 
antisense RNA. 

1 4. A method according to any preceding claims wherein the promoter is inducible, 

15. A method according to any preceding claim wherein transcription of the DNA 
can be activated by a site specific recombinase. 

16. A method according to any preceding claim wherein replication of the second 
vector can be prevented by a site specific recombinase. 
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1 7. A vector for transfection of a cell, wherein: 

(i) the vector contains a DNA, or is adapted to receive a DNA, in operative 
combination with a promoter for expression of the DNA; 

(ii) extrachramosomal replication of the vector is dependant upon presence 
within the cei! of a replication factor; and 

(ili) the vector does not express the replication factor. 

18. A vector according to Claim 17 wherein the replication factor is a viral 
replication factor. 

19. A vector according to Claim 17 or 18 wherein the viral replication factor is 
selected from polyoma large T antigen, EBNA-1 antigen, papilloma virus 
replication factors, SV40 large T antigen and functional variants, analogues 
and derivatives thereof. 

20. A vector according to any of Claims 17 to 19 wherein the vector is 
substantiaiiy free of DNA coding for the replication factor or any part thereof. 

21 . A vector according to any of Claims 17 to 20 for transfection of mammalian or 
avian cells. 

22. A vector according to any of Claims 17 to 21 for transfection of ES cells. 

23. A vector according to Claim 22 comprising a natural target for polyoma large 
T antigen, such as Ori or functional variants thereof adapted to bind to 
polyoma large T antigen. 

24. A vector according to any of Claims 17-23 wherein the DNA codes for a 
polypeptide or protein. 



25. A vector according to any of Claims 17-23 wherein the DNA codes for an 
antisense DNA. 
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26. A vector according io any of Claims 17-25 wherein the promoter is inducible. 



27. A vector according to any of Claims 17 to 26 wherein the vector comprises a 
sequence coding for a selectable marker. 

28. Use of a vector according to any of Claims 17-27 for expression of a DNA 
sequence within a ceil. 

29. A cell transfected with a first vector that expresses a replication factor and with 
a second vector according to any of Claims 17 to 27. 

30. A mammalian cell according to Claim 29. 

31. An embryonic ceil according to Claim 29. 

32. A cell selected from an ES, EC or EG cell according to any of Claims 29 to 31 , 
and differentiated progeny thereof. 

33. An assay for the effect of presence in a ceil of a protein or polypeptide or other 
product of DNA expression, comprising the steps: 

(a) (i) transfecting the ceil with a first vector that expresses a 

replication factor; or 
(ii) otherwise obtaining a eel! that expresses or will express the 
replication factor; 

(b) transfecting the cell with a second vector, wherein 

(i) the second vector contains a DNA coding for the protein or 
polypeptide or other product of DNA expression in operative 
combination with a promoter for expression of the DNA; 

(ii) the second vector also contains a DNA coding for a selectable 
marker in operative combination with a promoter for expression 
of the selectable marker; and 

(iti) extrachromosomal replication of the second vector is dependant 
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upon presence within the ceil of the replication factor; 
{c) selecting for ceils that have been transfecied with the second vector; 
and 

(d) maintaining the selected cells over a plurality of generations so as to 
assay the effect of expression of the protein or polypeptide or other 
product of DNA expression. 

34. An assay according to Claim 33 wherein step (a) is carried out once and the 
cells obtained are divided and used for a plurality of separate assays in which 
steps (b)-{d) are carried out a plurality of times with second vectors containing 
different DNA sequences. 

35. An assay according to Claim 33 or 34 for assay of the effect of presence in the 
cell of two factors, each factor being independently selected from a protein, a 
polypeptide and another product of DNA expression. 

36. A method of screening a library of cDNAs comprising assaying the effect of 
expression of each of the cDNAs according to the method of any of Claims 33 
to 35. 

37. A method of investigating the properties of a DNA sequence comprising 
expressing in a cell a composite DNA including (a) the DNA sequence under 
investigation, linked to (b) a DNA coding for a cell active protein, wherein 

activity of the cell active protein is dependant upon transport of the cell active 
protein to the ceil surface, and 

the DNA of (b) does not code for a polypeptide capable of directing 
transportation of the cell active protein to the eel! surface. 



38. 



A method according to Claim 37 for screening a library of DNAs to Identify 
DNA sequences coding for signal polypeptide sequences that transport 
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proteins to the ceil surface, and the method optionally comprises determining 
whether the celi active protein is transported to the celt surface and remains 
there or is secreted by the ceil. 

39. A method according to Ciaim 37 or 38 wherein the DNA of (b) is obtained by 
deleting or disabling, from a DNA encoding a ceii surface or secreted protein, 
that portion of the DNA that codes for the polypeptide sequence responsible 
for transportation of the protein to the ceil surface. 

40. A method according to any of Claims 37 to 39 wherein the cell active protein 
induces a morphological or proliferative change in the cell. 

41 . A method according to any of Claims 37 to 40 wherein the ceil active protein 
inhibits differentiation of the cell and in the absence of the cell active protein 
the celi wilt differentiate. 

42. A method according to any of Claims 37 to 41 wherein the cell active protein 
is a cell surface receptor. 

43. A method according to Ciaim 42 wherein the eel! active protein is an IL-6 
receptor and the DNA of (b) encodes a modified form of the receptor 
preprotein lacking a functional signal sequence. 

44. A method according to any of Claims 37 to 43 comprising investigating the 
properties of a DNA in mammalian or avian cells. 

45. A method according to any of Claims 37 to 44 comprising investigating the 
properties of a DNA in embryonic cells. 

46. A method according to Claim 45 comprising investigating the properties of a 
DNA in ES, EC or EG cells or differentiated progeny of such ceils. 
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47. A method according to any of Claims 37 to 46 comprising expressing the 
composite DNA by; 

{a} (i) transfecting the ceil with a first vector that expresses a 
replication factor; or 
(ii) otherwise obtaining a eel! that expresses or wili express the 
repiication factor; 

(b) transfecting the cell with a second vector, wherein 

(i) the second vector contains the composite DNA in operative 
combination with a promoter for expression of the composite 
DNA; 

(ii) the second vector also contains a DNA coding for a selectable 
marker in operative combination with a promoter for expression 
of the selectable marker; and 

(iii) extrachromosomal replication of the second vector is dependant 
upon presence within the celi of the replication factor; 

(c) selecting for cetis that have been transfected with the second vector; 
and 

(d) maintaining the selected cells over a plurality of generations so as to 
assay the effect of expression of the composite DNA. 

48. A method according to claim 47 wherein step (a) is carried out once and the 
cells obtained are divided and used for a plurality of separate methods in 
which steps (b)-(d) are carried out a plurality of times with second vectors 
containing different DNA sequences. 

49. A method according to any of Claims 37 to 48 for identification of a DNA 
coding for a cefl surface or secreted protein. 



50. A method according to any of Claims 37 to 48 for identification of a cell 
surface or secreted protein. 
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FIG. 9 

SEQUENCE OF LINKER 01 JfiONUCLEOTIDES. 



a) FLAG linker 

FLAG epitope 
Xhol Not I Gly Ser ASP TYR LYS ASP ASP 

ASP CTA GA G TCG AGT AGC GGC CGC GGC AGC GAC TAG AAG GAG 
GAC GAC 

BssHII 

ASP LYS Gly Ser Cys Arg Ala 
GAC AAG GGG AGC TGC CGC GCG C 



b) [gly 4 ser] 2 linker 

Xhol NotI Gly Gly Gly Gly Ser Gly Gly 

Gly Gly Ser 

CTA GA C TCG AGT AGC GGC CGC GGA GGC GGA TAC GGA AGC GGA 
GGA GGG AGC 

BssHII 
Cys Arg Ala 

TGC CGC GCG C 
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FIG. 10 

SEQUENCE OF TRUNCATED AND MODIFIED 1L6R. 



a) FLAGdeltaIL6R 

TCTAGACTCGAGTAGCGGCCGCGGCAGCGACTACA^^ 
GGCAAATGGCACAGTGACAAGCCTGCCAGGGGCCACrc^ 

GGACGTGCAGCTCAGCGACACTGGGGACTATrTATGCTCC^ 

GGTGGATGTTC CCCCAGAGGAGC CCAAGCTCTCCTGCTTCCGG AAGAAC CCCCTTGTCAACGC CAT CTGTGAGTG 

GAGTGACTTCCAGGTGCCCTGCCAGTArTCTCAGQGCT^^ 

TGACAAAGTATACCACATAGTGTCACTGTGCGTTGCAAA^ 

TCACAGCTTAAAAATGGTGCAGCCGGATCCAC^ 

GCTCAAAGTCAGCTGGCAGCACCCT'GAGACCTGGGACCCGAGT^ACTACTTGCTGCAGTTCCAGCTTCGATACCG 
ACCTGTATGGTCAAAGGAGTTCACSGTGT?GCTGCTCCCGGT0K3^ 

GCGAGGAGTGAAGCACGTGGTCCAGGTCCGTGGGAAGGAGGAGCTTGACCTTGGCCAGTGGAOTGAATGGTCCCC 
AGAGGTCACS^CACTCCTTGGATAGC^GCCCAG^ 

CTCTGTTGAAGACTCTGCCAACCACGAGGATCAGTACGAAAGTTCTACAGAAGCAACGAGTGTCCTCGCCCCAGT 

GCAAGAATCCTCGTCCATGTCCTTGCCCACATTCCTGI^^ 

TGTCTTCATC^TCCTGAGACrCAAGCAGAA^TGGAAGTCAGAG^ 

\CCCCACACAGCTCTGG 

GTCTGACAATACCGTAAACCi 
CTACTTATTCCCCAGATAA 

b) Egly<ser]jdeltaIL6R 



TGGCACAGTGACAAGCCTGCCAGGGGCCACCGTTACCCTGATTTC 

CATCCACTGGGTGTACTCTGGCTCACAAAACAGAGAA 

GCAGCTCAGCGACACTGGGGACTATTTATGCTCCCTGAAT^ 

TOOTCCCCCAGAGGAGCC£^GCTCTCCTGCTTCCG^ 

GAGCAGCACCCCCTCTCCAACCACGAAGGCTGTGCTGTrTC 

CTTCCAGGTGCCCTGCCAGTATTCTraGCAGC^^ 

AGTATACmCATAGTGTCACTGTGCGTTCGAA^ 

cttaraaatggtgcsgccggatccacx:tgcc^ 

AGTCAGCTGGC^GCACCCTGAGACCTGGGACCCGAGTTACTACTTGCTGCAGTTCCAGCTTCGATAC 

AraSTCAAAGGAGTTCACGGTGTTGCTGCTCCC^TGGCCCAGTACCAATGCSTCATCCATGATGCC^ 

AGTGAAGCACGTGGTCCAGGTCCGTGGGAAGGAGGAGCTTGACCTi^CAGTGGAGT^ 

CACGGG^CTCCTTGGATAGCACUVGCCCAGGACCA^^ 

TGAAGACTCTGCC^CCACGAGGATCAGTACGAAAGr^ 

ATCCTCOTCCLAreTCCCTGCCCACATTCCT^ 

CATCATCCTGAGACTCAAGCAGAAATGGAAGTCAGAGGCTGAGRAGGAAAGCAAGACGACCTCTCCTCCACCCCC 
ACCGTATTCCrrGGGCCCACTGAAOXGA^ 

CAATACCGT AARCCACAG CTG CCTGGGTGTCAGGGACGCACAGAG CCCTTATGACAACAGGAACAG AGACTACTT 
ATTCCCCAGATAA 
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DNA Expression In Transfected Cells And 
Assays Carried Out in Transfected Cells 

This invention reiates to methods of expressing DNA in cells, to vectors for 
expression of DNA in cells and to transfected cells. The invention also relates to 
assays carried out in transfected ceils or differentiated derivatives of such cells. In 
particular the invention relates to transfection of and expression of DNA in embryonic 
stem (ES) ceils. 

The weaiih of sequence information now becoming available from the genome 
projects demands the development of new, high throughput systems for functional 
analysis, A powerful route to discovering and characterising genes involved in 
determination and differentiation in mammals is potentially available via the genetic 
manipulation of ES ceils in vitro. 

ES celts, which are derived from the piuripotentiai inner celi mass (ICM) of the 
preim plantation mouse embryo (2,3), retain the capacity for multiiineage differentiation 
both in vitro (4,5) and in vivo (6,7). in principle, therefore, gene products which 
influence developmental decisions should be assayable in ES ceil culture systems, 
whatever the source of the cells. However, there are major difficulties in analysing 
cDNA function by ES eel! transfection. The frequency of isolating stable transfectants 
is low (<10' 4 by eiectroporation, calcium phosphate co-precipitation or iipofection) and 
the great majority of transfectants show heterogeneous and unstable expression. 

These problems are particularly significant in the case of cDNAs whose expression 
causes differentiation because differentiated ES cell progeny do not generally 
proliferate. In such cases transfectants may still be isolated but transgene expression 
will be minimal. 

Episomai vectors have been used for functional screening in other cell types in order 
to increase the frequency of stable transfection and to achieve reliable transgene 
expression. However, previously described episomai vectors, for example based on 
Epstein-Barr virus (EBV) or bovine papilloma virus (BPV), have limitations both in 
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host cell range and maintenance during long-term culture. 
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A modified extrachromosomal vector is known based on the replication system of 
murine polyoma virus (8), This plasmid, pMGD20neo, can be stably maintained as 
an episome in ES cells during long term culture, Importantly, the low levels of large 
T protein produced have no overt effect on the growth or differentiation properties of 
the ES ceils (8,9). It is also known to maintain simultaneously with pMGD20neo a 
second episomai vector. Expression from the second vector was not possible hence 
pMGD20neo was used for cDNA expression. However, this vector already comprises 
two expression cassettes, one each for large T antigen and the neo selectable marker 
so its size constrains its use for expression of a third cassette containing a cDNA. 

It is an object of the invention to provide a vector for transfection of and expression 
of DNA within a cell and a method of expressing DNA in a celi that overcomes or at 
least ameliorates the disadvantages identified in the art. An object of at least the 
preferred embodiments of the invention is to achieve, in a transfected cell, expression 
that is more stable and more homogenous than hitherto attainable. Further objects 
of preferred embodiments of the invention are to provide a method of expressing a 
DNA in an embryonic cell in a more stable and more homogenous manner than 
hitherto attainable, and to provide for stable transfection of embryonic cells at a 
higher frequency than can be obtained using conventional vectors. 

The invention Is based upon the maintenance of a vector within a cell, wherein 
maintenance of the vector is dependant upon the continued presence within the eel! 
of a certain factor and wherein that factor is not expressed by the vector but is 
produced in or present in the cell in an amount sufficient to maintain the vector. 

Accordingly the invention provides a transfection and expression method comprising, 
in a cell that expresses or will express a replication factor, introducing a vector 
dependant upon that replication factor. Thus, in a first aspect, the invention provides 
a method of expressing a DNA in a cell, comprising: 

(a) (i) transfecting the cell with a first vector that expresses a 
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replication factor; or 
(ii) otherwise obtaining a celi that expresses or will express the 
replication factor; 

and 

(b) transfecting the ceii with a second vector, wherein 

(i) the second vector contains a DNA, or is adapted to receive a 
DNA, in operative combination with a promoter for expression of 
the DNA; and 

(ii) extrachromosomai replication of the second vector is dependant 
upon presence within the ceii of the replication factor. 

The replication factor is optionaity non-toxic to the celi. Alternatively, the replication 
factor is toxic to the cell at high levels of expression but at iow ieveis of expression 
is substantially non-toxic to the ceii but at these low levels is present in sufficient 
amount to enable replication of the second vector. 

Further, the replication factor preferably does not alter the ability of the celi to 
differentiate or proliferate, and may thus be regarded as being neutral to the celi 
phenotype. This enables the activities of the product of a cDNA to be investigated 
over a iong time period and many ceil generations without having to take account of 
possible interfering effects of the replication factor present within the cell. Again, the 
replication factor may be phenotype-neutral at ail ieveis or may be neutral at a low 
level which is nevertheless a sufficient level to maintain the second vector within the 
ceil. 

The invention is of application to ali celi types for which there exists, whether from a 
natural or synthetic source, a replication factor capable of maintaining in that ceii type 
an episomal vector. The vector is preferably stably maintained, meaning it is 
maintained over a number of ceil generations, and at least over 3 generations. The 
ceil is preferably selected from the group consisting of mammalian cells, in particular 
primate ceils or murine cells, and avian celis. it is further preferred that the cell is an 
embryonic cell, in particular an ES, EC (embryonic carcinoma) or EG (embryonic 
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gonadal} ceil, or differentiated progeny of any such cell. 

While reference is made to the second vector, it will be appreciated that the 
replication factor is optionally present in the cell other than following transfection with 
a first vector. For example a culture of ceils that already express the replication factor 
may be obtainable from a third party. 

in an embodiment of the invention described in detail below, the method comprises 
transfecting an ES cell with a first vector that expresses a viral replication factor, and 
thereafter transfecting the ES cell with a second vector that expresses a cDNA and 
is dependant upon presence of the virai replication factor for its extrachromosoma! 
replication within the ES cell. The frequency of the first transfection step is generally 
low and may result in as few as 1 in 10 5 successful stable transfectants - this level 
of success is recognised as typical in this art. However, the second transfection has 
surprisingly and advantageously found to result in a significantly higher frequency of 
successful stable transfectant colonies being obtained. The second transfection can 
be carried out with a 1% or higher success rate, which represents a 100-fold 
improvement over the art. 

One suitable viral replication factor for mouse cells, in particular mouse ES cells, is 
polyoma iarge T antigen, in which case the cell of step (a) expresses the polyoma 
large T antigen and the second vector comprises an origin of replication that binds 
the polyoma large T antigen, such as the polyoma replication origin, referred to as 
Or/. Another suitable viral replication factor for primate cells is based upon Epstein 
Barr virus, in which the primate cell of step (a) expresses the EBNA-1 antigen and 
the second vector comprises an origin of replication that binds EBNA-1 , such as OriP. 
Viral replication factors are generally species - specific and so expression of DNA 
according to the invention is dependent upon choice of a replication factor appropriate 
to the celi. Polyoma large T has been described for use in mouse ceils. E/7NA-1 is 
suitable for human cells. Still further systems are optionally based on papilloma virus 
replication factors, for human cells, or SV40 virus large T antigen, for simian cells, 
and further suitable replication factors may also be selected from functional variants, 
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derivatives and analogues of these replication factors, such as temperature sensitive 
variants. 

In use, the second vector is constructed according to standard techniques so as to 
contain a cDNA sequence or insert of interest operatively combined with a promoter 
to express the cDNA. The second vector is used to transfect an ES cell already 
expressing a replication factor and successful transfectants are recovered in which 
it is found that the second vector is stably maintained within the ES cell and 
expresses the cDNA with a more homogenous pattern than when prior art techniques 
are followed. Thus, the invention provides an advantageous method for expression 
of a cDNA in a celt. 

In this context, "homogenous" in relation to expression of a cDNA in a colony of 
transfected ES cells is used to indicate that most cells, or a large proportion of cells, 
or preferably most cells, or more preferably substantially ail cells, express the cDNA 
and "stable" is used to indicate that the cells continue to express the cDNA at a 
similar level and preferably at substantially the same level. In the examples carried 
out to date and described below, homogenous transfection is seen with the method 
of the invention to a greater extent than in the art methods. Also, in the examples 
carried out to date and described below the method results in more stable expression, 
meaning that expression does not alter over time. This has the advantage that study 
of the long term effects of a cDNA product is facilitated. 

It is optional for the ceil of step (a) first to be obtained or prepared by transfection of 
a ceil by a first vector and for this then to be used for the starting cells for carrying 
out a plurality of separate transfections by second vectors containing different DNA 
inserts coding for different DNA products of interest. Following this procedure, the first 
transfection may be carried out with the level of success typically seen in 
conventional techniques and the ES cells obtained divided into separate colonies. 
The second transfections, introducing the DNA insert in the second vector, are then 
carried out with the higher levels of success typically seen in the methods of the 
invention. 
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In the case that the method comprises transfection with first and second vectors, it 
is preferable for the first vector to code for a selectable marker and for the second 
vector also to code for a selectable marker, though a different one. In a specific 
embodiment of the invention described below, the first vector codes for hygromycin 
resistance and the second codes for neomycin resistance. This allows selection of 
ES celis in which transfection by both first and second vectors has been successful. 

It is a further embodiment of the invention for the method to comprise an additional 
transfection step with a third vector, wherein the third vector contains a cDNA, or is 
adapted to receive a cDNA, in operative combination with a promoter for expression 
of the cDNA, and extrachromosomal replication of the third vector is dependant upon 
presence within the ES ceil of the replication factor. Transfection with the third vector 
is optionally at the same time as transfection with the second vector or subsequent 
thereto. 

The second and third vectors preferably each comprise a selectable marker enabling 
selection of ES ceils in which transfection has been successful. The respective 
selectable markers are preferably different if the method comprises transfection with 
both second and third vectors, and preferably different again from the selectable 
marker of the first vector. 

It is a feature of particular embodiments of the invention that the second vector (and 
third or subsequent vectors if present) are not able to express the replication factor, 
in fact, in construction of the second vector from a vector comprising DNA encoding 
the replication factor it is preferable for that DNA to be largely or substantially 
completely deleted. 

in a specific embodiment of the invention, the first vector is pMDG20neo and 
expresses polyoma large T antigen and the second vector comprises the natural 
target for polyoma large T antigen, namely Ori, expresses a cDNA of interest but 
does not express large T antigen. In use, the large T antigen is expressed by the 
first vector and binds to Or! of the second vector when it enters an ES cell, thus 
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enabling replication of the second vector and its maintenance within the ES cell in an 
extrachromosomal state, in successful transfectants, the vector remains 
extrachromosomal, and this is believed to render the vector relatively immune from 
effects seen when a vector is integrated into the host ES cell genome, which effect 
may include silencing of the cDNA resulting in unstable and heterogeneous 
expression. 

An alternative to use of the first episomai vector is to introduce into the cell a 
construct that expresses the replication factor and integrates with the celt genome. 
The construct should therefore include a DNA sequence coding for the replication 
factor and means for selection of cells in which the construct has successfully 
integrated; one example is a construct that comprises cDNA coding for, in order, large 
T antigen - an internal ribosome entry site (IRES) - Bgeo. A culture of cells is then 
obtained by selecting for cells that express the selectable marker, such as in this 
case by selection in G418. Staining with Xgal is used to identify transfectant clones 
which show stable and homogenous expression. The construct preferably comprises 
a promoter that gives stable, low level expression in transfected cells, such as the 
HMGCoA promoter for ES cells. The cells obtained can then be subjected to 
transfeciion with the second and optionally third and subsequent vectors. 

in another embodiment of the invention the second vector comprises an inducible 
promoter. Some types of differentiated cells, derived from ES cells, can only be 
obtained with any reliability if a particular differentiating factor is expressed after a 
prior event. One example is neurone formation which generally only occurs after 
aggregation of cells. Thus, using an inducible promoter, expression of DNA that 
codes for the factor that leads to neurone formation can be controlled until the ES 
cells have suitably aggregated. Interferon responsive promoters are some examples 
of inducible promoters. Alternatively, the cDNA is designed to be in a non-functional 
form and to be capable of being modified into a functional form at a later time. One 
possibility is for the cDNA to be disrupted for example by termination sequences 
which are flanked by target sites for a site specific recombinase, such as loxP sites, 
removable by Cre recombinase, or frt sites removable by Flp recombinase. Cre and 
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Ftp can be fused to steroid hormone receptors in order to make their activity 
regulatable. After administration of steroid the Cre or Fip recombinase will translocate 
to the nucleus and there convert the cDNA into a functional form by excision of the 
disrupting sequence. It may also be desired to stop or inhibit or reduce replication 
of the second vector; the method optionally comprises using a site specific 
recombinase to present replication of the second vector. This can be achieved by 
deletion of a sequence in the vector to which the replication factor must bind in order 
for the vector to be replicated by the host cell. 

The term DNA or cDNA is usually understood to refer to a DNA sequence that is 
transcribed into a mRNA that is translated into a polypeptide or protein, in the present 
invention the term is also intended to encompass any product of DNA expression. 
It thus includes DNA coding for an antisense RNA, or for an antisense ribozyme 
molecule. 

The method of the invention is suitable for assaying effects of DNA expression, due 
to the stability and efficiency of expression achievable. Accordingly, the invention 
further relates to an assay for the effect of presence in a cell of any product of DNA 
expression - such as protein, polypeptide, antisense RNA, ribozyme RNA, transfer 
RNA or other. The method comprises steps (a) and (b) as described above wherein 
the second vector also contains a DNA coding for a selectable marker. The method 
further comprises selecting for cells that have been transfected with the second 
vector and maintaining the selected cells over a plurality of generations. 

Step (a) may be carried out once and then steps (b) onwards repeated for different 
assays, and the method is of particular application to screening a cDNA library. 
Furthermore, two or more cDNAs can be expressed in the same cell to assay the 
effect of the combination of .their respective expression products. 

The invention also relates to a vector. Accordingly, the invention provides, in a 
second aspect, a vector for transfection of an ES cell, wherein: 

(i) the vector contains a DNA, or is adapted to receive a DNA, in operative 
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combination with a promoter for expression of the DNA; 
(is) extrachromosomal replication of the vector is dependant upon presence 

within the ES cell of a replication factor; and 
(tit) the vector does not express the replication factor. 

The vector is characterized in preferred embodiments as described above in relation 
to the second vector of the first aspect of the invention. 

it is an advantage of at least preferred embodiments of the invention that due to very 
high efficiency of stable secondary transfection (supertransfection) of cells, for 
example transfection of ES cells harbouring pMGD20neo with a second plasmid 
containing the polyoma replication origin (Ori) (8), that expression of DNA is stably 
and efficiently achieved from the second plasmid. 

Another aspect of the present invention provides a method of screening for new 
DNAs that encode signal sequences and proteins that are transported to the cell 
surface. The invention according provides a method of investigating the properties 
of a DNA sequence comprising expressing in a cell a composite DNA including (a) 
the DNA sequence under investigation, linked to (b) a DNA coding for a cell active 
protein, wherein 

activity of the cell active protein is dependant upon transport of the ceil active 
protein to the cell surface, and 

the DNA of (b) does not code for a polypeptide capable of directing 
transportation of the cell active protein to the cell surface. 

This offers the advantage that where the DNA of interest does indeed code for a 
sequence that transports a polypeptide to the cell surface, whether that polypeptide 
remains there or is ultimately secreted, this will be apparent from observation that the 
eel! active protein has had or is having its known effect. Thus the method offers a 
convenient means of identifying DNA sequences that will transport proteins to the ceil 
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surface. 

The method is suitably used for screening a library of DMAs to identify DNA 
sequences coding for signal polypeptide sequences that transport proteins to the cell 
surface. The cell active protein if transported to the ceil surface may remain there 
or be secreted by the cell, and this distinction may be separately assayed, or 
example by examination of the make-up of the culture medium before and after the 
investigation. 

One convenient way to obtain the DMA of (b) is by deleting or disabiing, from a DNA 
encoding a cell surface or secreted protein, that portion of the DNA that codes for the 
polypeptide sequence responsible for transportation of the protein to the cell surface. 
The cell active protein is optionaiiy a cell surface receptor and the DNA of (b) can 
thus encode a modified form of the receptor preprotein lacking a functional signal 
sequence. In a specific embodiment described below the IL-6 receptor is used as 
expression of the receptor in ES cells can be used to inhibit differentiation of the celis 
- a readily observable property of the cell active protein. Gross morphological or 
proliferative changes induced in the cell by the celi active protein are of course readily 
observed, though the invention is of application to any celi active protein whose 
activity, when it is transported to the cell surface and / or secreted, can be assayed. 

A specific embodiment of this aspect of the invention comprises expressing the 
composite DNA by: 

(a) (i) transfecting a cell with a first vector that expresses a replication 

factor; or 

(ii) otherwise obtaining a cell that expresses the replication factor; 

(b) transfecting the ceil with a second vector, wherein 

(i) the second vector contains the composite DNA in operative 
combination with a promoter for expression of the composite 
DNA; 

(ii) the second vector also contains a DNA coding for a selectable 
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marker in operative combination with a promoter for expression 
of the selectable marker; and 
(iit) extrachromosomai replication of the second vector is dependant 
upon presence within the cell of the replication factor; 

(c) selecting for cells that have been transfected with the second vector; 
and 

(d) maintaining the selected cells over a plurality of generations so as to 
assay the effect of expression of the composite DNA. 

If many investigations are to be carried out it is preferred that step (a) is carried out 
once and the cells obtained are divided and used for a plurality of separate methods 
in which steps (b)-(d) are carried out a plurality of times with second vectors 
containing different DNA sequences. This offers the advantage that typically the first 
transfection step is of Sower efficiency than the second, so the method avoids having 
to repeat the low efficiency step too often. 

It is particularly preferred that the method is used for identification of a DNA coding 
for a ceil surface or secreted protein, and using the method to screen a library of 
DNAs provides a means of carrying out the screen for discovery of such DNAs and 
investigation of their properties. More especially, the method is for discovery of 
hitherto unknown or uncharacterized cell surface or secreted proteins, or for location 
of the coding sequence of known proteins of this type. 

This aspect of the invention optionally further incorporates in preferred embodiments 
features of transfection of ceils described above in relation to other aspects of the 
present invention. 

The invention enables development of a series of vectors which give highly efficient 
and robust expression of transgenes in cells. Cloned cDNAs of interest can rapidly 
be characterised using this system, it is also applicable to the discovery of novel 
regulatory molecules through functional expression screening of cDNA libraries. 
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Due to their pluripotent and proliferative character, key cellular processes such as 
viability, propagation, determination and differentiation, can be analyzed in transfected 
ES cells. The "supertransfection" system of the invention overcomes the limitations 
associated with conventional cDNA transfection and opens a powerful new route to 
gene discovery and characterisation in mammals. 

Key features of the episomal supertransfection system, described according to the 
examples below, are that very high efficiencies of stable transfection are obtained and 
that cDNA expression is homogeneous, stable and reliably dictated by promoter 
strength. The increased efficiency of isolating stable transfectants is significant 
because it allows reliable detection of cDNAs whose expression results in cell death 
or differentiation, in addition a high transfection efficiency is generally advantageous 
for any high throughput assay system and is essential for functional cDNA library 
screening. The reliability of cDNA expression is critical for functional studies and the 
robust nature of expression from episomal vectors contrasts favourably with the 
variable and unstable expression observed in conventional ES cell transfectants. 

Heterogeneous expression of integrated transgenes is not an artefact arising from use 
of bacterial lacZ as a reporter gene, firstly because similar observations have been 
made using mammalian thy-1 as a reporter in F9 cells, and secondly because 
ubiquitous expression of lacZ can readily be obtained following gene trap integrations 
(23,24). The expression pattern throughout the population cannot be determined by 
Northern bfot but can only be revealed by in situ hybridization or use of a linked 
reporter gene such as IRES-/acZ (25) Heterogeneous expression, which previously 
occurred in the great majority of transfected clones following stable integration, gave 
unclear or misleading results on the phenotypic consequences of transgene 
expression. 

The difference in expression pattern between conventional transfectants and episomal 
supertransfeciants of the invention arises because an extrachromosomat copy of a 
transgene is not subject to alteration during the integration process nor to modification 
arising from the genomic sequences flanking an integration site. The so-called 
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"position effect" can modify both the level and pattern of transgene expression in 
stable transfectants. Furthermore, the expression of integrated transgenes is often 
suppressed over several generations in ES ceil cultures. This silencing phenomenon 
contributes to the high backgrounds which can be obtained in double replacement 
type targeting strategies (28) , it has been observed in stable transfectants with 
different transgenes driven by viral promoters or minimal mammalian promoters such 
as the widely used human /?-actin and mouse PGK-1 promoter elements. One 
hypothesis to explain this phenomenon is that transgenes may become targets of cfe 
novo methyltransferase in stem ceils (27). Macleod et af. (28) reported that a 
methyiation free locus couid be generated in transgenic mice by introduction of the 
whole CpG island of the aprt promoter. 

Whatever the molecular mechanism of silencing, it appears not to occur to episomaiiy 
maintained transgenes in vectors of the invention. In addition, the level of expression 
obtained from vectors of the invention is reliably dictated by promoter strength and 
can predictably be varied over at least a 10-fold range by appropriate choice of 
promoter. Eptsomai constructs of the invention thus offer considerable advantages 
for functional expression studies in ES cells. 

Functional cDNA expression cloning is a powerful method for direct isolation of 
important genes. The expression screening approach has often been employed to 
isolate cDNAs encoding surface and secreted molecules via transient expression, for 
example in COS ceils, in a few cases EBV-based systems have also been applied 
to isolate intracellular regulatory genes via stable expression in the target celis (29- 
32) . The high efficiency of supertransfection in the poiyoma system of the invention 
indicates that this approach could be applied to functional cloning in ES cells. Based 
on a transfectton efficiency of 2.5%, a library of 5x1 0 5 cDNA clones could be 
screened by electroporation of 2x1 Q 7 cells with 1QG//g DNA. For an effective library 
screen, the majority of transfectants should oniy take up a single piasmid. It is aiso 
advantageous if the cDNAs can readiiy be recovered in unrearranged form. Both of 
these conditions are satisfied by the episomal supertransfection system. By 
screening libraries prepared from undifferentiated ES cells it may be possible to 
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isolate cDNAs whose products mediate self-renewai. In this case direct selection can 
be applied for colony formation in the absence of LIF. For cDNAs whose products 
direct differentiation, however, it will be necessary either to screen pools through 
several rounds or to incorporate an inducible promoter into the episome. 

Recently, several improved protocols for in vitro differentiation of ES cells have been 
reported, which promote efficient generation of, for example, haematopoietic cells (33) 
, neurons (34) or cardiomyocytes (35). The episomal expression strategy of the 
invention can be applied for gain-of-function assays and screens during these 
differentiation programmes. It can also be used for loss-of-function analyses via 
overexpression of anti-sense RNA or dominant-negative mutants. Combination of 
these differentiation systems with the episorna! expression system will therefore 
provide powerful tools for analysing cell determination and differentiation events. 

The invention is now described with reference to the accompanying drawings in 
which: 

Fig. 1 shows the structure of the episomal expression vector pHPCAG; 
Fig. 2 shows supertransfection efficiency of pHPCAG in MG1.19 ES cells; 
Fig. 3 shows DNA hybridisation analysis of Hirt supernatants from 
su pertran sfectants; 

Fig. 4 shows the effect of vector size on supertransfection efficiency; 
Fig. 5 shows expression of #-ga!actosidase in MG1.19 transfectants; 
Fig. 6 shows the restriction pattern of plasmid DNAs recovered from 
pHPCAG-/acZ supertransfectant clone; 

Fig. 7 shows induction of differentiation by expression of STAT3F in MG 
1.19 ES cells; 

Fig. 8 shows co-supertransfection of STAT3F with wild type STAT 
expression vectors; 

Fig. 9 shows linker sequences for use in an assay of the invention; 

Fig. 10 shows DNA sequences coding for truncated and modified IL6R; and 

Fig. 1 1 shows a vector for use in an assay of the invention. 
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In more detail: 

Figure 1 shows the structure of the episomal expression vector pHPCAG. cDNAs can 
be introduced between two BstXl sites using BstX\ adaptors. Abbreviations: ALT20: 
deleted polyoma large T expression cassette LT20; Pyori/enh: mouse polyoma virus 
replication origin and mouse polyoma mutant enhancer derived from F101 strain; 
SVpA: SV40 polyA addition signal; PGKhpnpA: hygromycin B phosphotransferase 
gene expression cassette with mouse phosphogiycerokinase-1 (PGK) promoter and 
polyA addition signal; GAG: combined CAG expression unit; /T-gtobinpA: rabbit 0- 
globin polyA addition signal; SVori: SV40 replication origin; ColElori: CoiE1 
replication origin; amp: Eco// /lactamase gene conferring resistance to ampiciilin. 

Figure 2 shows supertransfection efficiency of pHPCAG in MG1.19 ES cells. 

(A) shows numbers of transfectant colonies per microgram of pHPCAG DNA. 5x1 0 6 
MG1.19 ES cells were supertransfected with the indicated amounts of supercoiled 
pHPCAG followed by selection with hygromycin B for 8 days. The resulting number 
of drug-resistant colonies were scored and efficiency per jjg DNA calculated, 

(B) shows total numbers of transfectant colonies plotted against total amount of 
plasmid DNA. 

Figure 3 shows DNA hybridisation analysis of Hirt supernatants from 
supertransfectants. Hirt supernatants were prepared from 5x1 0 6 parental MG1.19 
ceils and pooled pHPCAG supertransfectants. 1/20 of each sample was digested 
with either Eco Ri or tf/ndlll and analyzed by filter hybridisation using a 344bp Sea 
!~Sspl fragment from pUC19 which is common to both pMGD20neo and pHPCAG. 

Figure 4 shows the effect of vector size on supertransfection efficiency. 20//g of each 
of the supercoiled vectors pLT20AWdelhp/? (4.7), pLT20ABs£X!/?p/> (5.5), 
pLT20AA/w/Vlhpf? (5.6), pLT20&Sac\hph (5.9), ptkp (6.2), pSV40e/p (6.4), 
PGKhphALT20 (6.5), pmPGKp (6.6), phBAp (6.6), pHPCAG (7.7), ptkp-/acZ (8.9), 
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pSV40e/p-/acZ(9.1), pmPGKp-/acZ{9.3), phBAp-/acZ<9.3), and pHPCAG-/acZ(10.4) 
were individually supertransfected into 5x1 0 8 MG1.19 ES ceils. The resulting 
numbers of hygromycin B resistant colonies were scored after 8 days. Transfection 
efficiencies are normalised relative PGKhphALT20. 

Figure 5 shows expression of /?-galactosldase in MG1.19 transfectants. Primary 
colonies were stained with Xgal after 8 days of selection. 

(A) shows typical homogeneous staining pattern obtained following supertransfection 
with supercoiied pHPCAG-/acZ. 

(B) shows heterogeneous staining pattern obtained in minority of clones following 
supertransfection with supercoiied pHPCAG-/acZ. 

(C) shows heterogeneous staining pattern typically observed following electroporation 
of linearized pHPCAG-/acZ and stable integration. 

(D) shows rare faint staining pattern obtained after supertransfection with supercoiied 
pHPCAG-/acZ. 

Figure 6 shows the restriction pattern of plasmid DNAs recovered from pHPCAG-/acZ 
supertransfectant clone. 

A supertransfectant MG1.19 clone carrying pHPCAG-/acZ was cultured for 60 days 
in the presence of hygromycin B. Hirt DNA was then prepared and 
eiectrotransformed into E.coli DH10B cells. Plasmid DNAs were recovered from 
transformants, digested with EcoRI, resolved by electrophoresis on 1 .0% agarose gei 
and visualised by ethidium bromide staining. Expected fragment sizes: pMGD20neo, 
4852bp and 2884bp; pHHPCAG-/aeZ, 3697bp, 2810bp, 783bp and 397bp. Lane 1: 
size marker (1Mb iadder:BRL); lane 2: control pMGD20; lane 3 : control pHPCAG- 
/acZ; lane 4; recovered pMGD20; lane 5,6: recovered pHPCAG-teeZ. 
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Figure 7 shows induction of differentiation by expression of STAT3F in MG 1.19 ES 
ceifs, 

(A) shows proportion of differentiated colonies in LIF-supplemented medium restating 
from supertransfection of STAT3, antisense STATS and STAT3F expression vectors. 
Colonies were fixed and stained with Leishman's reagent after 8 days selection and 
numbers of stem cell colonies and differentiated colonies scored. 

(B) shows marker gene expression in STAT3F supertransfectants: Expression of 
marker genes in pools of MG 1.1 9 cells supertransfected with STAT3 (lane 1), STAT3 
antisense {lane 2) and STAT3F (lane 3) expression vectors. Total RNA was prepared 
after 8 days of selection in UF-supplemented medium and 5/jg aliquots analyzed by 
filter hybridisation with j?-globin, Rex-1 , H19 and G3PDH probes. The jJ-globin probe 
detects alt transgene mRNA species generated from pHPCAG, including an 
alternativeiy spliced product from the antisense construct. 

(C) shows photomicrographs of representative colonies 8 days after supertransfection 
with (i) STAT3, (ii) STAT3F, and (Hi) empty expression vectors and selection in the 
presence of LIF, or, (iv) induction of differentiation by culture in the absence of LIF 
for 8 days. 

Figure 8 shows co-supertransfection of STAT3F with wild type STAT expression 
vectors. Proportions of undifferentiated stem cell colonies generated after co- 
supertransfection of MG1.19 ES cells with 10>g pBPCAGGS-STAT3F plus 10/ig 
pHPCAG vector containing staffer (control), STAT3, STAT1 or STAT4 inserts. After 
8 days selection with 8Gpg/ml of hygromycin B plus 20//g/ml of blasticidin S, colonies 
were fixed and stained with Leishman's reagent. 
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EXAMPLE 1 
Materials and Methods 
Vector constructions. 

Standard recombinant DNA methods were used to construct all ptasmids(10) . 
Plasmid pHPCAG (Fig 1) was constructed from pMGD20neo{8) . The PGKneopolyA 
sequence was replaced by a hygromycin resistance marker, PGKhphpA, and large 
T sequences were deleted (see Results). A Sail-Seal fragment containing the CAG 
expression unit, a SsfXI stuffer sequence, the polyA addition signal derived from the 
rabbit £-globin gene and an SV40 replication origin (11) was inserted. Coding 
sequences for /?-galactosidase, LIF or interleukin-2 were introduced between the 
BstX! sites. 

For construction of episomal expression vectors with alternative promoters, the Sail- 
Xba\ fragment containing the CAG expression unit in pHPCAG-/acZ was replaced with 
the 344 bp SV40 enhancer/promoter (SV40e/p), the 466 bp human 0-actin promoter 
(hBA), the 502 bp mouse phosphoglycerate kinase promoter (mPGK) and the 90 bp 
HSV-tk minimal promoter (tk), resulting in pHPSV40e/p-/acZ, pHPhBA-/acZ, 
pHPmPGK-tecZ and pHPtk~/acZ, respectively. 

Episomal vectors with alternative selection markers were constructed by replacing the 
PGKhphpA cassette in pHPCAG with the SVbsrpA cassette carrying the E.coli 
btasticidin S deaminase (b$r) gene derived from pSV26sr (Waken Seiyaku) or the 
hCMVzeopA cassette carrying the Streptoalloteichus bleomycin resistant gene {Sh 
ble) derived from pZeoSV (Invitrogen) to generate pBPCAGGS and pZPCAGGS, 
respectively. 

Cell culture and transfection, 

MG1.19 ES cells are derivatives of the CCE tine which stably maintain around 20 
episomal copies of pMGDneo(8) , They were maintained on gelatin-coated plates in 
Glasgow modified Eagle's medium (GMEM. Gibco-BRL) supplemented with 10% fetal 
calf serum, 0.1 mM £-mercaptoethanol, non-essential amino acids, 200 //g/mi G418, 
and 100U/ml LIF produced in COS-7 celis{11,12) , For supertransfection, routinely, 
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5x1 0 6 MG1.19 cells were suspended in 800 pi of PBS, incubated with 20 fig of 
supercoiied vector DNA for 10 min on ice, and electroporated at 200V/960//F using 
a Bio-Rad gene pulser. Celfs were transferred into geiatinized plates and allowed to 
recover overnight before addition of appropriate selection agent. Histochemica! 
staining for jS-galactostdase was carried out with 5-bromo-4-chloro-3-tndoiy! /3-D- 
gafactopyranoside (X-gal) (13) , and £-ga!actosidase activity was measured by 
incubation of cell extracts with o-nitrophenyl-/?-D-galaetopyranostde (ONPG). 
Differentiation was induced in monolayer culture as described (12) . 

Analysis of episomal vectors in the supertransfectants. 

Hirt supernatants were prepared as described (14) . For amplification of recovered 
episomal vectors, electrocompetent E. coli DH10B cells were transformed by 
electropo ration at 2500V/25//F/200 1 /! 

Results 

Construction of an episomai expression vector 

Potyoma-based plasmids have recently been reported to be competent for episomal 
propagation in ES ceils (8) . The plasmid pMGD20neo contains a modified iarge T 
expression unit called LT20, the viral origin of replication (Ori), and the PGKneopA 
cassette as a selectable marker. This plasmid can be maintained as an 
extrachromosomal element in wild-type ES cells. It can be modified to include a 
cDNA expression unit (9) . However, the low frequency of conventional stable 
transfection of ES cells (A 1 x 10" 5 ) remains a limiting feature. Furthermore, episomal 
propagation oniy occurs in 10-15% of primary transfectants (8,9) . 

A second piasmid has been described which can be maintained as an episome only 
in ES celts which independently express the large T protein (8) . This piasmid, 
PGKhphALT20, contains LT20 with a large deletion in its coding sequence, Ori, and 
PGKftpnpA as a selectable marker. When introduced into a ceil tine such as MG1 .19, 
in which episomal maintenance of pMGDneo has already been established, the yieid 
of hygromycin B resistant stable transfectants is extremely high. This phenomenon 
of supertransfection is presumed to arise from the p re-existence of large T protein in 



WO 98/32868 



PCT/GB9S/00216 



- 20 - 

the recipient cells. 

In the studies reported below the modification and use of supertransfection vectors 
for cDNA expression is characterised. 

Size of vector 

PGK/jpftALT2G retains part of the large T coding sequence. We made a series of 
deletions in the ALT20 sequence to minimize the vector size and thereby increase 
the capacity for inserts and reduce potential bias in the construction and screening 
of cDNA libraries. The supertransfection efficiency of four derivative piasmids was 
then compared in MG1.19 cells. All showed comparable supertransfection efficiency 
to PGKnp/)ALT20 (data not shown). The smallest, pLT20A Woe I np/?. has a deletion 
of 2953 bp, yielding an episomal vector backbone of only 4.7kb. 

Expression unit 

into this minimal episomal vector we introduced a cDNA expression unit 
Transcriptional initiation signals are supplied by the CAG cassette(11) , which 
comprises the human cytomegalovirus immediate early enhancer, a 1kb fragment of 
the chicken /?-actin gene (promoter, non-coding first exon and first intron), and a 
splice acceptor derived from the rabbit /?-giobin gene. This combination has been 
shown to direct strong expression of cDNAs in undifferentiated stem cells. The 
resulting expression vector, pHPCAG (Fig 1), contains the CAG sequences followed 
by the BsfXI stuffer sequence derived from pCDM8 as a cDNA cloning site, and a 
poiyA addition signal derived from the rabbit /?-giobin gene. In addition the piasmid 
contains the PGKfrpnpA (15) cassette for hygromycin seiection of ES cell 
transfectants, the poiyomaOn with pyF 101 -derived mutant enhancer element (1 6) for 
stable episomal replication in cells expressing polyoma iarge T protein, and the fi- 
lactamase (amp) gene and prokaryotic replication origin for amplification in E, coll. 
The SV40 Or/ is also present to allow for transient episomal replication in mammalian 
host cells expressing SV40 iargeT, such as COS ceils (17) . 
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Characterization of supertransfection. 

The parameters of supertransfection with pHPCAG and derivatives were investigated. 
First, 5x1 0 6 MG1.19 ceils were electroporated with various amount of supercoiied 
pHPCAG. selected in medium containing 80 pg/ml of hygromycin B for 8 days, and 
the number of stem ceil colonies scored after Leishman's staining(12) . Although the 
highest efficiency per pg DNA was observed with minimum amounts (1-2 /ig) of 
vector DMA (Fig. 2B), the total yield of hygromycin B resistant colonies increased with 
increasing amount of piasmid (Fig 2A). Saturation was not reached over the range 
of plasmid concentrations tested. With 100 //g piasmid DMA, 150,000 hygromycin B- 
resistant colonies were obtained, representing 3% of total treated ceils. Disablement 
for episomai replication by linearisation of pHPCAG prior to electroporation reduced 
this transfection efficiency to less than 0.01%. 

Next, increasing numbers of MG 1.19 ceils were subjected to electroporation with 1 00 
Ijq of pHPCAG DNA. Comparable stable transfection efficiencies in the range 3-6% 
were obtained with up to 2.5x1 0 7 celis. 

The copy number of pHPCAG in the supertransfectants was analyzed by preparation 
of Hirt supernatants followed by filter hybridisation. This analysts revealed that 
supertransfected cells carried approximately 20 copies each of pMGDneo and 
pHPCAG (Fig. 3). 

These data demonstrate that the efficiency of supertransfection with pHPCAG is 
extremely high. However, episomai vectors can be limited in their capacity for inserts 
because increased size may cause inefficient replication or instability. To investigate 
this issue in the ES cell system, episomai vectors of different size were 
supertransfected into MG 1.19 cells. The numbers of supertransfectant colonies were 
scored and plotted against vector size (Fig. 4). These data indicate that there is a 
progressive reduction in transfection efficiency with increasing piasmid size, in 
particular, the largest piasmid tested, a derivative of pHPCAG with a 3kb lacZ insert 
(total size 10.4kb) showed a 50% reduction in colony number. However, that this 
may not be due entirely to the size of the piasmid because the very high levels of/?- 
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galactosidase expression may exert some toxic effects (see below). 



laeZ expression in supertransfeetants. 

To evaluate the ievel and pattern of expression of transgenes from pHPCAG, the 
E.coli /?-gaiactosidase (/acZ) gene was introduced into this vector. The resulting 
vector, pHPCAG-/acZ, was introduced into MG1.19 ceils and supertransfeetants 
isolated by selection with 80 pg/ml of hygromycin B for 8 days. The number of 
colonies isolated was 50% of the number obtained in a parallel supertransfection with 
pHPCAG (see above). The colonies were smaller and many of the cells showed an 
abnormal spindle-shaped morphology. These effects were not observed with several 
other inserts in pHPCAG and are suggestive of a toxic effect of the high level lacZ 
expression. The primary supertransfeetants were stained with X-ga! and the staining 
pattern examined under phase-contrast microscopy. Staining was detectable after 
5 minutes incubation and was intense by 1 hour. This ievel of ^-galactosidase 
activity is significantly higher than we have observed from a variety of integrated 
expression constructs. 

Approximately 80% of supertransfectant colonies showed ubiquitous expression 
(>90% cell positive) as shown in Fig.5-A (i). Of the remainder, 15% showed 
heterogeneous expression {Fig.5-A (ii)}, and 5% showed little or no staining (Fig.5-A 
(iv)}. The latter two classes are likely to arise as a result of vector integration which 
occurs in up to 20% of supertransfeetants (8). in transfectants derived by 
electroporation of linearized pHPCAG-fecZ into MG1.19 cells (which results in vector 
integration in the majority of clones), only 15 % of colonies showed homogeneous 
staining whereas 70% of colonies stained heterogeneously (Fig.5-A (iii)), and 15% 
showed no expression. 

Analysis of expanded clones from each class of transfectant established that this 
difference in expression characteristics was stable. Twelve of 13 expanded 
supertransfeetants expressed lacZ homogeneously. In contrast, only 4 out of 24 
clones derived using linearized vector showed homogeneous expression. This is 
consistent with our previous observations on integrated expression constructs in ES 
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ceils, in fact the CAG unit gives a significantly higher frequency of colonies which 
show stable ubiquitous expression than other promoters we have examined. 

The difference in staining pattern between episomafiy maintained and integrated 
vectors indicates that the former escape modifying influences arising from integration 
and reliably give full activity of the expression unit 

Comparison of expression with various promoters on episomal vector. 

An ability reliably to generate predetermined levels of expression would be a 
important attribute for a transgene expression system. The previous observations 
suggested that episomal vectors offered potential to achieve unmodified expression. 
Various promoters with different strengths in undifferentiated stem cells were 
therefore introduced into the episomal vector by replacing the CAG expression unit 
of pHPCAG-/acZ. Expression of the lacZ reporter was then assayed in both transient 
and stable supertransfectants {Table 1). The relative ratio of /?-galactosidase activity 
obtained from the SV40 enhancer/promoter complex, the human /S-actin promoter, 
the mouse PGK-1 promoter and the HSV-tk minimal promoter in transient transfectant 
was retained in stabie supertransfectants. The CAG expression unit showed 
strongest activity in the tested constructs in both transient and stable transfectants. 
In this case, however, the relative ratio in transient transfectants, 1 9 times higher than 
SV40, was significantly reduced in stable transfectants. This may arise from an 
elimination of strong expressants due to a toxic effect of high lacZ expression (see 
above). A reduced number of supertransfectants and smaller size of colonies was 
observed only with the CAG vector. 

Stability of supertransfected episomal expression vector during long-term 
culture and differentiation of host cells. 

A critical limitation of previously described episomal vectors is their instability during 
long-term culture. Many episomal vectors undergo integration into the host genome 
after long-term culture, resulting in a reduction in expression and inability to recover 
transgenes by preparing Hirt supernatants. To test the stability of the 
supertransfection system, four pHPCAG-/acZ supertransfectant clones were cultured 
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for 60 days (approximately 90 generations) under continuous selection with 80 pg/ml 
of hygromycin B. Three of the four clones maintained relatively constant ievefs of fS- 
galactosidase activity determined by ONPG assay and uniform expression as 
revealed by Xgal staining. The fourth clone showed unstable and variegated 
expression, as commonly observed on vector integration. Hirt supernatants were 
prepared from one of the stably expressing clones at the end of the 60 day culture 
period. Filter hybridization analysis of the Hirt DNA indicated that the ES cells carried 
approximately 20 copies of pMGD20 and 5 copies of pHPCAG-/acZ per cell (data not 
shown). The lower copy number of pHPCAG-/acZ may be due to its larger size 
and/or the toxic effect of strong fecZ expression. The Hirt DNA was transformed into 
E.coli for further analysis. Of the bacterial transformants, 20% carried pHPCAG-/acZ 
and the remainder carried pMGDneo20, in good agreement with the hybridization 
data. Restriction mapping showed no evidence of rearrangement in either piasmid 
(Figure 6). 

In the experiment above, cells were maintained under selection with hygromycin B. 
In the absence of selection pressure, supertransfectant clones lost expression of 0- 
galactosidase over several passages in culture. This might indicate an intrinsic 
instability of supertransfected episomal vectors. However, it could also reflect a 
selective disadvantage for ES cells which express high levels of ^-galactosidase. It 
is noteworthy in this regard that the primary episome, pMGD20neo, is stable In the 
absence of seiection(S) . 

Stability of expression from pHPCAG-/acZ during the in vitro differentiation of ES ceils 
was also analyzed. Differentiation was induced in three ways: withdrawal of LIF; 
exposure to reiinoic acid; and treatment with 3-methoxybenzamide(18) . After 6 days 
the differentiated progeny stained ubiquitously in all three cases (data not shown). 

These data indicate that supertransfected episomal vectors can be maintained in an 
extrachromosomai state and direct strong expression of transgenes during long-term 
self-renewal and differentiation in vitro. 
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Production and secretion of the cytokine LIF from an epfsomal ES cell 
expression vector. 

The pHPCAG-tecZ plasmsd can efficiently direct strong and homogeneous expression 
of the cytoplasmic tacZ reporter gene. We next investigated expression of a secreted 
moiecute, the cytokine LIF. LIF is an essential supplement to ES cell culture medium 
because it inhibits differentiation of the stem cells (19,20) . Expression of LIF can 
readily be assayed by formation of stem cell colonies in media tacking the cytokine. 

Episomal vectors for expression of another cytokine, interleukin-2 (which has no 
effect on ES cell phenotype), and for LIF were electroporated in parallel into MG1.19 
cells. The ceils were seeded at low density (1 .5x1 0 4 and 5x1 0 3 cells per 90mm plate) 
to avoid the rescue effect which arises from the production of LIF by differentiated ES 
cell progeny (21) , and cultured with 80pg/ml of hygromycin B for 8 days. pHPCAG- 
H2 generated large numbers of stem cell colonies in medium supplemented with LIF, 
but none in the absence of LIF. pHPCAG-tff in contrast produced comparable 
numbers of healthy stem cell colonies in both the presence and absence of 
exogenous LIF (Table 2). These colonies couid be expanded and propagated without 
LIF-supplementation of the medium. These data confirm previous observations that 
increased autocrine expression of LIF renders ES cells factor-independent (22) and 
establish that secreted proteins are produced efficiently and stably by this episomal 
expression system. 

Co-supertransfection of episomal vectors. 

Introduction of two or more different transgenes into cells is often required for 
analysis of protein interactions and/or co-operative function. The poor efficiency of 
homogeneous expression in conventional transfectants is a major obstacle for such 
investigations in ES ceils. To test the possibility that the episomal approach couid 
be applied to co-express multiple cDNAs, we constructed episomal expression 
vectors with different selection markers. Co-supertransfection of episomal vectors 
was then assessed. 



The basic episomal expression vector pHPCAG carries the hygromycin 
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phosphotransferase gene driven by mouse PGK-1 promoter (PGKhphpA). We 
prepared episoma! vectors which carry the zeocin-resistance gene driven by the 
human cytomegalovirus immediate-early promoter (pZPCAG), or the blasticidin S- 
resistance gene driven by the SV40 enhancer/promoter (pBPCAG) by substitution 
of the PGKhphpA cassette in pHPCAG. These vectors were supertransfected into 
MG1.19 ceils followed by 8 days selection with the appropriate antibiotic. 
Comparison of the numbers of resulting drug-resistant coionies (Table 3) revealed 
that these selection systems are slightly less efficient than hygromycin B selection but 
nonetheless enable large numbers of supertransfectants to be isolated. 

ES cells harbouring two different episomal vectors can be isolated by repeated 
supertransfection. Supertransfectants carrying pHPCAG can be transfected again 
with pBPCAG or pZPCAG, with comparable efficiency to the original supertransfection 
into MG1 .19 ES cells (data not shown). This should allow establishment of efficient 
screens for assaying functional interactions between gene products. 

The effects of co-electoporation of supertransfection vectors were also investigated, 
pHPCAG (10 jug) and pBPCAG {10 //g) were co-electroporated into 5x10 s MG1.19 
cells. Cells were selected in hygromycin B or blasticidin S only, or both, for 8 days 
and the number of drug-resistant coionies scored in each case. The numbers of 
hygromycin or blasticidin S single-resistant colonies were 39,000 and 13,000, 
respectively, while the number of double-resistant coionies was 1,200. Thus the 
apparent efficiency of incorporation of both plasmids was less than 10%. Similar 
results were obtained on co-supertransfection of pHPCAG and pZPCAG (not shown). 
These data suggest that the majority of supertransfectants incorporate only one 
plasmid under these electroporation conditions. This is significant for application of 
the episomal system to functional cDNA library screening. 

EXAMPLE 2 

The effects of overexpression of a large number of transgenes in ES cells were 
investigated by construction of vectors based on pHPCAG and including a DNA insert 
coding for the transgene being investigated. 5 x 10 6 ES MG1.19 cells were 
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supertransfected with 20 fig of expression vectors and selected with 80 jt/g/ml of 
hygromycin 8 for 8 days. The numbers of drug-resistant colonies were counted and 
normalised relative to numbers obtained with empty vector. The results are shown in 
Table 4. 

EXAMPLE 3 

inhibition of STATS activation blocks seif-renewal and promotes differentiation 

To assess directly the requirement for STAT3 activation in ES eel! self-renewal, we 
exploited a dominant interfering mutant form of STAT3, STAT3F. In this mutant 
(Minami et a/., 1996), the tyrosine residue at amino acid position 705 is mutated to 
phenylalanine. Phosphorylation of Tyr705 is required for dimerization and nuclear 
translocation. When expressed at high level, STAT3F has been shown to block the 
activation of endogenous STAT3 in various cell types, possibly by titrating out 
receptor docking sites (Fukada ef at., 1996; Minami ef a!., 1996; Nakajima et at., 
1996; Sonni ef a/., 1997; lhara et a/., 1997). 

Using conventional transfection approaches we were unable to recover ES cell 
transfectants showing stable high level expression of STAT3F. In parallel 
experiments, however, transfection of the UF-independent embryonal carcinoma cell 
line P19 yielded multiple expressing clones. This suggested that blockade of STAT3 
activation in ES cells specifically resulted in cell death, growth arrest or differentiation. 
The transfection and expression strategy of the invention was therefore adopted to 
enable characterisation of the consequences of STAT3F expression. 

The STAT3F mutant cDNA was introduced into the supertransfection vector pHPCAG. 
The wild type STAT3 coding sequence was aiso introduced, in both sense and 
antisense orientations. The three constructs were electroporated into MG1.19 cells 
which harbour a large T expression pfasmid and can be supertransfected with 
constructs containing the polyoma origin (Gassmann ef a/., 1995). Supertransfectants 
were isolated by selection in hygromycin B for 8 days in the presence of LIF. 
Colonies were fixed, stained with Leishman's reagent, counted, and scored for the 
presence of stem cells and differentiated cells. More than 95% of colonies obtained 
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following supertransfection with control or wild type STAT3 vector were stem cell 
colonies (Figure 7A). A modest increase in the proportion of differentiated colonies 
was obtained with the antisense construct. The STAT3F vector, however, yielded 
predominantly differentiated colonies. A decrease in total number of colonies was also 
observed after supertransfection with STAT3F. This may reflect an early onset of 
differentiation which would produce very small clones that would not be scored. 
Alternatively, very high levels of STAT3F expression may also be toxic, though this 
has not been reported in other cell types. Morphologically, the differentiated STATS F 
colonies closely resembled the differentiated colonies generated on culture of ES 
cells in the absence of LIF (Figure 7C). Various other cDNAs have been expressed 
in ES ceils using this system, with little or no effect on differentiation (data not 
shown). This suggested that the effect on differentiation was specifically attributable 
to expression of STAT3F. 

The differentiation induced by expression of STAT3F was examined further by 
expression analysis of the marker genes rexl and H19. Rex-1 mRNA, which is 
specifically expressed in undifferentiated stem cells, was down regulated in STAT3F 
supertransfectants. in contrast, H19 RNA which is found at low levels in stem cells 
but is upregulated during differentiation, was increased (Figure 7B). A similar pattern 
of gene regulation is observed during differentiation of ES cells induced by withdrawal 
of LIF. These data confirm that the morphological differentiation triggered by STAT3F 
is accompanied by reprogramming of gene expression, 

STAT3F was also expressed from the mouse phosphoglycerate kinase (pgk-1) 
promoter in the episomat vector pHPPGK. This vector gives at least 10-fold lower 
expression than pHPCAG (data not shown), in this case, there was no significant 
effect on either colony number or differentiation status of MG1 .19 supertransfectants. 
A critical level of expression of the dominant interfering mutant therefore appears 
necessary to biock self-renewal. 
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Effect of STAT3F on self-renewal is suppressed bv co-expression of STAT3 

To test whether the induction of differentiation by expression of STAT3F was due to 
an inhibition of endogenous STAT3 activity, we attempted to rescue the stem cell 
phenotype by co-expression of wi!d type STAT3 and also of STAT1 and STAT4. A 
STAT3F expression vector carrying a blasticidln resistance marker was co- 
supertransfected into MGt.19 ceils with episomal constructs for expression of wild 
type STATs and hygromycin resistance. Co-supertransfectants were isolated in 
medium containing both 20/ig/ml of biasttcidin S and 80/ig/mi of hygromycin B. The 
numbers of stem cell and differentiated colonies were scored after 8 days. As shown 
in Figure 8, only co-expression of wild type STAT3 restored self-renewal in the 
presence of STAT3F. Transfection with STAT1 or STAT4 constructs alone had no 
effect on self-renewal in the absence of STAT3F {not shown) and did not alter 
differentiation induced by STAT3F. In the case of supertransfection with the CAG 
promoter STAT1 construct, the total number of colonies (stem + differentiated) 
recovered was reduced but the relative proportion of stem cell colonies versus 
differentiated cells was unaltered. This occurred in both the presence and absence 
of co-expression of STAT3F, and suggests that high level expression of STAT1 may 
be toxic to ES cells. By using the mouse PGK-1 promoter to drive lower levels of 
expression comparable numbers of colonies were recovered on transfection with the 
STAT1 as with the other constructs. In this case, again only the STAT3 construct 
showed any restoration of stem celi colonies, although to a tower degree than with 
the high expression CAG vector (not shown). These data indicate that STAT3 has a 
specific function in ES cells which cannot be compensated by STAT1 or STAT4. 

EXAMPLE 4 

The invention is also used in a strategy for direct selection of genes that code for 
secreted and eel! surface proteins. In one example of this strategy, the basic cioning 
vector is a truncated form of IL6R that lacks a signal sequence. This vector is 
described in detail below and shown in Fig. 11. If this truncated IL6R is expressed 
in ES ceils, it is not exported to the cell surface and these cells differentiate when 
cultured in !L6. However, if the 1L6R signal sequence is reconstituted by a signal 
sequence provided by a cDNA fragments cloned in frame at the 5' end of the 
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truncated 1L6R, the chimaeric receptor is expressed on the surface of ES cells. ES 
cells containing such chimaeric receptors are thus maintained as undifferentiated 
colonies when cultured in IL6. 

Libraries of short, 5' cDNA fragments are produced and cloned into a truncated and 
modified IL6R-based expression vector. ES cells transformed with such libraries 
express cDNA:IL6R fusion proteins. However, only eONAs that encode signal 
sequences confer IL6 responsiveness on ES ceils. These cDNAs alone give rise to 
undifferentiated, proliferating ES eel! clones. This strategy therefore provides a direct 
selection for cDNAs encoding secreted and cell surface proteins. 

The chimaeric IL6R is expressed in the episomal expression system described above 
(or a derivative thereof). This allows drug selection for episomally transformed cells 
and high level expression of cloned DNA. 

To further refine the selection system, ES cells are modified with two targeted 
mutations: 

a) A selectable marker gene, for example the blastictdin resistance gene, is 
introduced into the OCT-4 locus by standard targeting techniques. Since Oct-4 is 
expressed in undifferentiated ES cells, the blastictdin resistance gene will be 
expressed only by undifferentiated colonies. Blasticidin selection therefore is used to 
decrease background growth by ensuring rapid deletion of differentiating, Oct-4 
negative, ES cells. 

b) Since ES cells can produce LIF as an autocrine growth factor, ES cells are used 
in which both copies of the LIFR gene have been disrupted by gene targeting. This 
eliminates the possibility of LIF -dependent, false positive colonies that might 
otherwise persist throughout selection in IL6. 



Details of vector construction: 

1), IL6R was cloned into the episomal vector pCAGSP or a derivative (pCAGiPXN, 
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i.e. pCAGlP with a destroyed Notl site). pCAGiP contains an internal ribosome entry 
site (IRES) and a puromycin resistance gene downstream of its multiple cloning site, 
resulting in stoichiometric production of cDNA;IL6R fusion proteins in transfected cells 
under puromycin selection. IL6R in pCAGIP provides a positive control (IL6- 
responsive functional protein on the cell surface), and the basis of the new vector. 

2) . To construct the cloning vector, IL6R cDNA was truncated by cleavage with 
BssHli at nucleotide number 92. This deleted the initiator ATG and sequences 
encoding the signal sequence. 

3) . To minimise potential steric interference by cloned proteins with IL6 binding and 
1L6R function, DNA encoding a synthetic flexible linker peptide was then added to the 
5' end of the truncated 1L6R. Two alternative linkers have been used: gly gly gly gly 
ser gly gly gly gly ser and a linker containing the FLAG epitope, gly ser ASP TYR 
LYS ASP ASP ASP ASP LYS (FLAG epitope in upper case). The sequence of these 
linkers is shown in Fig. 9. In each case, the linker sequence has been cloned in 
frame with IL6R and has two unique cloning sites (Xhol and Notl) at its 5' end, 
allowing the introduction of cDNA libraries, or specific cloned sequences, in a 
directional manner. The FLAG epitope is recognised by a commercially available 
monoclonal antibody (M2; available from IBS/Kodak) regardless of its position within 
a fusion protein, and will thus allow the expression levels of surface protein to be 
measured directly by irnmunocyiochemistry. 

4) . Vectors containing each of these linkers and an upstream signal sequence are 
tested for relative expression level and IL6R-function, as detailed below. 

To test the utility of these vectors for selecting proteins expressed at the cell surface, 
a number of known signal sequences are cloned into each vector. These are tested 
for surface expression and SL6R function. Signal sequences include those from rat 
CD4 (a protein with extracellular Ig domains), mouse sek (a receptor tyrosine kinase, 
with no extracellular Ig domains) and mouse sonic hedgehog {a secreted factor). 
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ES cells are transfected with vectors bearing candidate signal sequences by 
fipofectton or electroporatson, followed by puromycin selection for transfected ceils. 
After overnight growth in the presence of L1F, to maintain the undifferentiated state 
and proliferation, transfected cells are spfit into three groups and treated with either 
1) LiF, 2) [L6 or 3) neither growth factor. Only ceils bearing IL6R brought to the ceil 
surface by a fused signal peptide will proliferate in the presence of IL6. Positive 
controls include ES cells transfected with wiid-type 1L6R grown in the absence of LIF 
and the presence of IL6. Negative controls include empty vector (i.e truncated 1L6R 
with no 5' insert) grown in the presence of 1L6. To determine whether fusion proteins 
N-ierminal to !L6R block signalling (by steric hindrance), the proportion of such cells 
that express surface protein but fail to proliferate in response to iL6 is deduced by 
comparing the number of ceils expressing the FLAG epitope with the number that 
give rise to colonies. 

Vectors defined by this assay are then used in cDNA library screens. Preferably, 
sequences corresponding to 5' ends of cDNAs are generated from full length cDNA 
libraries and dtrectionally cloned in the screening vector. 

In the above description scientific publications are referred to under the following 
reference numbers: 

1 . Smith, A. G. (1992) Seminars in Cell Biology, 3, 385-399. 

2. Evans, M. J. and Kaufman, M. {1981) Nature, 292, 154-156. 

3. Martin, G. R. (1981) Proc. Natl. Acad. Sci. USA, 78, 7634-7638. 

4. Doetschman, T. C., Etstetter, H., Kate, M., Schmidt, W. and Kemler, R. (1985) 
J.EmbryoI.Exp.Morphol., 87, 27-45. 

5. Weiss, M. J. and Orkin, S. H. (1996) J. Clin. Invest, 97, 591-595. 

6. Bradley, A., Evans, M. J., Kaufman, M. H. and Robertson, E. (1984) Nature, 
309, 255-256. 

7. Beddington, R. S. P. and Robertson, E. J. (1989) Development, 105, 733-737. 

8. Gassmann, M„ Donoho, G., Berg, P. (1995) Proc. Natl. Acad. Sci.USA, 92, 
1292-1296. 

9. Camenisch, G„ Gruber, M„ Donoho, G., Van Sloun, P., Wenger, R. and 
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Gassmann, M. (1996) Nucleic Acids Research, 24, 3707-3713. 

10. Sambrook, J., Fritsch, E, F. and Maniatis, T. (eds.) (1989) Molecular Cloning: 
A Laboratory Manual, 2nd ed Ed. 3 vols. Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York. 

11. Ntwa, H., Yamamura, K.-L, Miyazaki, J.-i. (1991) Gene, 108, 193-200. 

12. Smith, A. G. (1991) J. Tiss. Cult Meth., 13, 89-94. 

13. Beddington, R. S. P., Morgenstem, J., Land, H. and Hogan, A. (1989) 
Development, 106, 37-46. 

14. Hirt, B. J. (1969) J. Mol, Biol., 26, 141-144. 

15. te Riele, H., Maandag, E. R„ Cfarke, A., Hooper, M. and Berns, A. (1990) 
Nature, 348, 649-651. 

16. Fujimura, F. K„ Deininger, P. L., Friedmann, T. and Linney, E. (1981) Celt, 23, 
809-814, 

17. Tsui, L. C,, Breitman, M. L., Siminovitch, L and et al, (1982) Cell, 309, 499- 
508. 

18. Smith, A. G. and Rathjen, P. D. (1991) Sem.Dev.BroL, 2, 317-327. 

19. Smith, A. G., Heath, J. K„ Donaldson, D. D., Wong, G. G„ Moreau, J., Stahl, 
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We have thus described the development of an optimised transfection and expression 
system which will enable high throughput functional screening of cDNAs in 
piuripotential mouse embryonic stem (ES) ceils and differentiated derivatives. The 
strategy is based on extrachromosomal vector replication driven by expression of 
polyoma large T protein. When a vector containing a polyoma origin of replication 
is introduced into an ES cell line that harbours polyoma large T antigen, a high 
frequency of stable secondary transfection results. This process is referred to as 
supertransfection. Supertransfected plasmids can be maintained eptsomaliy during 
long-term culture and during differentiation in vitro. Expression of a /?-gaIactosidase 
reporter from an episomai vector is both ubiquitous and stable, in contrast to the 
variegated and unstable expression usually observed after cDNA integration into the 
ES cell genome. Moreover, in the absence of integration, promoter strength is 
predictable and a range of expression levels can reliably be achieved by using 
different elements. We also show that episomai vectors can be used for efficient 
expression of both cytosolic and secreted proteins. These features should make this 
system invaluable for functional analyses of defined cDNAs and for direct expression 
screening of cDNA pools or libraries in ES cells. 
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Table 1. Comparison of £-galactosidase activities directed by various promoters in 
transient and stable supertransfectants. 



Promoter Relative 0-gal activity 

transient stable 

SV40e/p 1.0 1.0 

h/?Ap 1.1 0.7 

mPGKp 0.5 0.5 

TKp 0.1 0.1 

CAG 19.0 18 



5x1 0 6 MG1.19 ES cells were supertransfected with 20//g of vector DNAs. After 3 
days culture for transient expression assay or 8 days selection with hygromycin B for 
stable expression assay, the /?-gaiactosidase activity generated by these constructs 
was measured by ONPG assay. Results are normalised relative to activity generated 
by the SV40e/p construct. See 'Materials and methods' for construction details of 
vectors. 
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Table 2. Supertransfection of LIF and tL-2 expression vectors into MG1.19 
ES cells. 



Vector LIF in medium No, of hyg r stem ceil colonies 

pHPCAG-//f + 42,000 

pHPCAG-//f - 38,000 

pHPCAG-Z/2 + 48,000 

pHPCAG-//2 - 0 

5x1 0 6 MG1.19 ES cells were supertransfected with 20*/g of vector DNAs, After 8 
days selection with 80//g/ml of hygromycin B in the presence or absence of LIF, the 
number of stem cell colonies were scored. 



WO 98/32868 



37 - 



PCT/GB98/00216 



Table 3. Efficiency of supertransfection of vectors with various selection markers. 

Selection marker Drug for selection f//g/ml) No. of resistant colonies 

PGKhphpA hygromycin B (80) 50,000 

SVesrpA blasticidin S (4) 12,600 

hCMVzeopA zeocin (20) 20,600 

5x1 0 6 MG1.19 ES cells were supertransfected with 20//g of vector DNAs of episomal 
vectors, pBPCAG and pZPCAG, which carry bsr and zeo resistance genes 
respectively. After 8 days selection with the appropriate drug, the number of drug- 
resistant stern cell colonies were scored. 
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Table 4. Effects of overexpression of transgenes in ES cells using pHPCAG. 



cDNA Relative number of Colony Size and 

hygro R colonies Morphology 



None 


1.00 


Normal 


iacZ 


0.64 


small 


DIA/LIF 


0.87 


slightly small 


IL-2 


0.92 


slightly small 


Rex-1 


0.88 


Norma! 


Fgf-2 


0.65 


Normal 


Fgf-4 


0.82 


Normal 


Fgf-5 


0.41 


Normal 


Oct-1 


0.17 


small 


Oci-2 


0.65 


slightly small 


Oct-3/4 


0.61 


differentiated 


Oct-6 


0.03 


some differentiation 


c-jun 


0.47 


small 


El A 


0.08 


differentiated 


Jak2 K/E 


0.75 


Normal 


bcI-2 


0.28 


small, spindle morphology 


MAPKP 


1.38 


Norma! 


RXRa 


0.20 


some differentiation 


RXR£ 


0.63 


Norma! 


RXRy 


0.91 


Normal 


COUP-TF1 


0.40 


some differentiation 


HNF-4 


0.05 


Normal 


Statl 


0.10 


small 


Stat3 


0.52 


Normal 


Stat4 


0.16 


Normal 


Stat3DON* 


0.14 


differentiated 



5x10 s ES MG 1.1 9 ceils were supertransfected with 20 jig of expression vectors and selected 
with 80 (ig/mi of hygromycin B for 8 days. The numbers of drug-resistant colonies were 
counted and normalised relative to numbers obtained with empty vector. 
StatSDON is the dominant interfering mutant form of Stat3 described by Akira et al, (1996). 
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Claims 

1 . A method of expressing a DNA in a cell, comprising; 

(a) (i) transfecting the cell with a first vector that expresses a 

replication factor; or 
(ii) otherwise obtaining a cell that expresses or will express the 
replication factor; 

and 

(b) transfecting the ceil with a second vector, wherein 

{i} the second vector contains a DNA, or is adapted to receive a 
DNA, in operative combination with a promoter for expression of 
the DNA; and 

(ii) extrachromosomal replication of the second vector is dependant 
upon presence within the cell of the replication factor. 

2. A method according to Claim 1 wherein the replication factor is a viral 
replication factor. 

3. A method according to claim 1 or 2 wherein the viral replication factor is 
selected from polyoma large T antigen, EBNA-1 antigen, papilloma virus 
replication factors, SV40 large T antigen and functional variants, analogues 
and derivatives thereof appropriate to the cell species. 

4. A method according to any of claims 1-3 wherein the second vector does not 
express the replication factor. 

5. A method according to any of claims 1-4 wherein the second vector expresses 
a selectable marker. 

6. A method according to any of claims 1-5 further comprising transfecting the 
ceil with a third vector, wherein the third vector contains a DNA, or is adapted 
to receive a DNA, in operative combination with a promoter for expression of 
the DNA, and replication of the third vector is dependant upon presence within 



WO 98/32868 

- 40 - 

the cell of the replication factor. 
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7. A method according to Claim 6 wherein the third vector expresses a selectable 
marker, which selectable marker is different to that expressed by the second 
vector. 

8. A method according to any preceding claim wherein the cell is a mammalian 
cell or an avian cell. 

9. A method according to any preceding claim wherein the cell is an embryonic 
cell. 

10. A method according to Claim 9 wherein the cell is an ES, EC or EG cell. 

11. A method according to any preceding claim for transfection of an ES cell 
wherein the ES cell of step (a) expresses polyoma large T antigen and the 
second vector comprises a natural target for polyoma large T antigen , such as 
On or functional variants thereof adapted to bind to polyoma large T antigen. 

12. A method according to any preceding claim wherein the DNA codes for a 
polypeptide or protein. 

13. A method according to any of Claims 1-11 wherein the DNA codes for an 
antisense RNA, 

14. A method according to any preceding claims wherein the promoter is inducible. 

15. A method according to any preceding claim wherein transcription of the DNA 
can be activated by a site specific recombinase. 

16. A method according to any preceding claim wherein replication of the second 
vector can be prevented by a site specific recombinase. 
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17. A vector for transfection of a celi, wherein: 

(i) the vector contains a DNA, or is adapted to receive a DNA, in operative 
combination with a promoter for expression of the DNA; 

(is) extrachromosomal replication of the vector is dependant upon presence 
within the ceil of a replication factor; and 

(iii) the vector does not express the replication factor. 

18. A vector according to Claim 17 wherein the replication factor is a viral 
replication factor. 

19. A vector according to Claim 17 or 18 wherein the viral replication factor is 
selected from polyoma large T antigen, EBNA-1 antigen, papilloma virus 
replication factors, SV40 large T antigen and functional variants, analogues 
and derivatives thereof. 

20. A vector according to any of Claims 17 to 19 wherein the vector is 
substantially free of DNA coding for the replication factor or any part thereof. 

21 . A vector according to any of Claims 1 7 to 20 for transfection of mammalian or 
avian cells. 

22. A vector according to any of Claims 17 to 21 for transfection of ES ceils. 

23. A vector according to Claim 22 comprising a natural target for polyoma large 
T antigen, such as Ori or functional variants thereof adapted to bind to 
polyoma large T antigen. 

24. A vector according to any of Claims 17-23 wherein the DNA codes for a 
polypeptide or protein. 

25. A vector according to any of Claims 17-23 wherein the DNA codes for an 
antisense DNA. 
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26. A vector according to any of Claims 17-25 wherein the promoter is inducible. 



27. A vector according to any of Claims 17 to 26 wherein the vector comprises a 
sequence coding for a selectable marker. 

28. Use of a vector according to any of Claims 17-27 for expression of a DNA 
sequence within a ceil. 

29. A ceil transfected with a first vector that expresses a replication factor and with 
a second vector according to any of Claims 17 to 27. 

30. A mammalian cell according to Claim 29. 

31 . An embryonic cell according to Claim 29. 

32. A cell selected from an ES, EC or EG cell according to any of Claims 29 to 31 , 
and differentiated progeny thereof. 

33. An assay for the effect of presence in a cell of a protein or polypeptide or other 
product of DNA expression, comprising the steps: 

(a) (i) transfecting the ceil with a first vector that expresses a 

replication factor; or 
(it) otherwise obtaining a cell that expresses or will express the 
replication factor; 

(b) transfecting the cell with a second vector, wherein 

(i) the second vector contains a DNA coding for the protein or 
polypeptide or other product of DNA expression in operative 
combination with a promoter for expression of the DNA; 

(ii) the second vector also contains a DNA coding for a selectable 
marker in operative combination with a promoter for expression 
of the selectable marker; and 

{iii} extrachromosomal replication of the second vector is dependant 
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upon presence within the celi of the replication factor; 

(c) selecting for celis that have been transfected with the second vector; 
and 

(d) maintaining the selected cells over a plurality of generations so as to 
assay the effect of expression of the protein or polypeptide or other 
product of DNA expression. 

34. An assay according to Claim 33 wherein step (a) is carried out once and the 
cells obtained are divided and used for a plurality of separate assays in which 
steps (b)-(d) are carried out a plurality of times with second vectors containing 
different DNA sequences. 

35. An assay according to Claim 33 or 34 for assay of the effect of presence in the 
cell of two factors, each factor being independently selected from a protein, a 
polypeptide and another product of DNA expression. 

36. A method of screening a library of cDNAs comprising assaying the effect of 
expression of each of the cDNAs according to the method of any of Claims 33 
to 35. 

37. A method of investigating the properties of a DNA sequence comprising 
expressing in a cell a composite DNA including (a) the DNA sequence under 
investigation, linked to (b) a DNA coding for a ceil active protein, wherein 

activity of the celi active protein is dependant upon transport of the eel! active 
protein to the ceil surface, and 

the DNA of (b) does not code for a polypeptide capable of directing 
transportation of the celi active protein to the cell surface. 



38. 



A method according to Claim 37 for screening a library of DNAs to identify 
DNA sequences coding for signal polypeptide sequences that transport 
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proteins to the cell surface, and the method optionally comprises determining 
whether the eel! active protein is transported to the cei! surface and remains 
there or is secreted by the cell, 

39. A method according to Claim 37 or 38 wherein the DNA of (b) is obtained by 
deleting or disabling, from a DNA encoding a cell surface or secreted protein, 
that portion of the DNA that codes for the polypeptide sequence responsible 
for transportation of the protein to the cell surface. 

40. A method according to any of Claims 37 to 39 wherein the ceil active protein 
induces a morphological or proliferative change in the cell. 

41 . A method according to any of Claims 37 to 40 wherein the ceil active protein 
inhibits differentiation of the cell and in the absence of the cei! active protein 
the celi wili differentiate, 

42. A method according to any of Claims 37 to 41 wherein the eel! active protein 
is a celi surface receptor. 

43. A method according to Claim 42 wherein the cell active protein is an IL-6 
receptor and the DNA of (b) encodes a modified form of the receptor 
preprotein lacking a functional signal sequence. 

44. A method according to any of Claims 37 to 43 comprising investigating the 
properties of a DNA in mammalian or avian cells. 

45. A method according to any of Claims 37 to 44 comprising investigating the 
properties of a DNA in embryonic cells. 



46. 



A method according to Claim 45 comprising investigating the properties of a 
DNA in ES, EC or EG ceils or differentiated progeny of such cells, 
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47. A method according to any of Claims 37 to 46 comprising expressing the 
composite DNA by: 

(a) (i) transfecting the celi with a first vector that expresses a 

replication factor; or 
(ii) otherwise obtaining a cell that expresses or will express the 
replication factor; 

(b) transfecting the ceil with a second vector, wherein 

(i) the second vector contains the composite DNA in operative 
combination with a promoter for expression of the composite 
DNA; 

(ii) the second vector also contains a DNA coding for a selectable 
marker in operative combination with a promoter for expression 
of the selectable marker; and 

(iii) extrachromosomal replication of the second vector is dependant 
upon presence within the cell of the replication factor; 

(c) selecting for cells that have been transfected with the second vector; 
and 

(d) maintaining the selected cells over a plurality of generations so as to 
assay the effect of expression of the composite DNA. 

48. A method according to claim 47 wherein step (a) is carried out once and the 
cells obtained are divided and used for a plurality of separate methods in 
which steps (b)-(d) are carried out a plurality of times with second vectors 
containing different DNA sequences. 

49. A method according to any of Claims 37 to 48 for identification of a DNA 
coding for a cell surface or secreted protein. 

50. A method according to any of Claims 37 to 48 for identification of a cei! 
surface or secreted protein. 
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FIG. 9 



SEQUENCE OF LINKER OLIGONUCLEOTIDES. 



a) FLAG linker 

FLAG epitope 

Xhol Not I Gly Ser ASP TYR LYS ASP ASP ASP 

CTA GA C TCG AGT AGC GGC CGC GGC AGC GAC TAC AAG GAC GAC GAC 

BssHII 

ASP LYS Gly Ser Cys Arg Ala 
GAC AAG GGG AGC TGC CGC GCG C 



b) tgly 4 ser] 3 linker 

Xhol NotI Gly Gly Gly Gly Ser Gly Gly Gly 

CTA G AC TCG AG T AGC GGC CGC GGA GGC GGA GGA AGC GGA GGA GGA 

BssHII 

Gly Ser Cys Arg Ala 
GGG AGC TGC CGC GCG C 



RECTIFIED SHEET (RULE AD 
tSA/EP 



WO 98/32868 



FCT/GB98/00216 



9/10 



FIG. 10 



SEQUENCE OF TRUNCATED AND MODI FIED IL6R. 

a) PLAGdeitaIL6R 

TCTAGACTCGAGTAGCGGCCGCGGC^GC^ 

GGCAAATGGCACAGTGACAAGCCTGCCA^GGCCACCGT^^ 

TGTTACCATTCACTGGGTGTACTCTCKCT^ 

GGTGGATGTTCCCCCAGAGGAGCCCAAGCTCTCCTGCTTCCGGAAGAACCCCCTTCTCAACGCCATCTGTGAGTG 
GCSTCCaaGCAGCACCOCCTCTTC^ 

TGACAAAGTATACCACATAC^GTCACTGTGCGTTGCAAACAGTGTGGGAAGCAAGTCCAGCCACAACGAAGCGTT 

T GAC AG CTTAAAAATGGTGC AGCCGGATC CACCTGCCAAC CTTGTGGTATC AGC CATACCTGGAAGG C CGCGCTG 

GCTCAAAGTCAGCTGGCAGCACCCTGAGACCTGGGACCCGAGTTACTACTTGCTGCAGTTCCAGCTTCGATACCG 

ACCTGTATGGTCAAAGGAGTTCACGGTGTTGCTGCTCCCGGTGGCCCAGTACCAATGCGTC^TCCATGATGCCTT 

GCGAGGAGTGAAGCACGTGGTCCAGGTCCGTGGGAAGGAGGAGGTTGACCTTGGCCAGTGGAGTGAATGGTCCCC 

AGAGGTCACGGGCACTCCTTGGATAGCAGAGCCCAGGACCACCCCGGCAGGAATCCTCTGGAACCCCACACAGGT 

(H'CTGTTGAAGACTCTGCCIAACCACGAGGATCAGTACGAAAGTTCT^ 

GCAAGAATCCTCGTCCATGTCCCTGCCCACATTCCT^ 

ACCCCCACCGTATTCCTTGC^CCCACTGAAGCCGA^ 

aTCTGACMWACCGTAAACCACAGCTOC 

CTACTTATTCCCCAGATAA 

b) [gly*ser],dettaIL6R 

TCTAGACTCGAGTAGCGGCCGCGGAGGCGGAGGAAGCGGAGGAGGAGGGAGCTGCCGCGCGCTGGAGGTGGCAAA 
TGGCACAGTGAC^GCCTGCCAGGGGCCACCGTTACTC^ 

cattc^ctck3GTGTactctggctcacaa^ 

GCAGCTCAGCGACACTCX3GGACTATTTATGCTCCCT^ 



CTTCCAGGTGCCCTGCCAGTATTCTC^ 
AGTATMCACATAGTGTCRCTGTGOT 

CTTAAAAATGGTGCAGCCGGATCCACCTGCCAACCTrGTGGTATC AG CCATAC CTGGAAGGCCGCGCTGGCTCAA 

AGTCAGCTGGCAGCACCCTC^GACCTGGa^CCCGAGTTACTACTTGCXGCAGTTCCAGCTTCGATACC 

ATGGTCAAAGGAGTTCAOGGTGTTGCTG^rrCCGGGTGGCCCAGTACCAATGCG^ 

AGTGAAGCACGTGGTCCAGGTCCGTGGGAAGGAaGAGCTTGACCTTGGCCAGTGGAGTGAATGGTC 

CACGGGCACTCCTTGC^TAGC^GAGCC^ 

TGAAGACTCTGCCAACCACGAGGATCAGTACGAAAGTTCTACAG 

ATCCTCGTCCATGTCCCTGCCCACATTCCTGG^^ 

CATCATCCTGAGACTCAAGCAGAAATGGAAGTCAGAGGCTGAGAAGGAAAGCAAGACGACCTCTCCTCCACCCCC 
ACCGTATTCCTTGGGCCCACTGAAGCCGACC^^ 

CAAXACCGTAAACCACAGCTGCCTGGGTGTCAGGGACGCACAGAGCCCTTATGACAACAGCAACAGAGACTACTT 
ATTCCCCAGATAA 
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